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PREFACE 


For appropriate process and quality control in the cement and concrete industry, a large 
number of data are derived. Optimum benefit is, however, only achieved if these are 
adequately processed and interpreted. Statistics is one of the important means to make best 
use of the data be it by application of numerical methods and/or by graphical representation. 


The present handbook describes the relevant basic definitions, formulae and applications of 
statistical methods which are useful in the cement industry. Emphasis is put on adequate 
data description and graphical representation to ensure reasonable processing and 
interpretation of statistical data. Most of the described procedures are illustrated with 
practical examples. 

Chapters 1 and 2 are concerned with the basic ideas of statistics, the rules for the 
representation of a given set of data (graphical representation, numerical measures) and the 
treatment of outliers. 


Chapters 3 to 8 present some statistical methods, useful for decision making, 
experimentation and process control. 


Of special practical significance is the Application Section (chapter 5.2), which includes a 
procedure manual with a general check list and a collection of important test procedures. 
Chapter 9 gives an outwork to more sophisticated statistics. 


Appendix | contains a selection of practical examples to illustrate applicability and 
interpretation of the demonstrated methods. A useful work sheet to construct frequency 
tables and to check the data for normality is given in Appendix II. Further Appendices 
contain: the required statistical tables for the determination of confidence limits and the 
application of test procedures, a list of recommended literature and an index of examples 
used in the text. 


A subject index (English, German, French) of statistical terms is provided in Appendix VI. 


The copyright for this documentation is reserved by "Holderbank" Management and 
Consulting Ltd. The right to reproduce it entirely or in part in any form or by any means is 
subject to the authorisation of "Holderbank" Management and Consulting Ltd. 
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1. INTRODUCTION 


Statistics is concerned with methods for collecting, organising, summarising, presenting and 
analysing data, obtained by measurement, counting or enumeration. 


With descriptive statistics a given set of observations is summarised or presented to get a 
quick survey of the corresponding phenomena. 


In a more sophisticated analysis of representative samples statistical inference (inductive 
statistics) allows conclusions to be drawn about the entire population. 


A sample is considered to be representative only if it is drawn from the population at 
random. Such a group of n observations is called a sample of size n. 


Some main topics of interest in statistical analysis are: 


a) Location of the data: 
Where are the observations located on the numerical scale? This question leads to the 


use of central values as the mean or the median. 


b) Variability or dispersion: 
Problems concerning the degree to which data tend to spread about an average value. 


c) Correlation: 
Degree of dependence between paired measurements, e.g.: Is there a real dependence 


of mortar strength on the alkali content in clinker? 


d) Regression: 
Fitting lines and curves to express the relationship between variables in a mathematical 
form especially used for prediction and calibration. 


e) Splitting variability: 
Looking for the importance of several causes to the variability of observations. Variability 
arises due to different components of random errors. Special experiments must be 
designed to split these componenis. 


The use of statistical methods and the interpretation of statistical results requires a certain 
comprehension of variability. If ten pieces of coal from a delivery are analysed as to their 
water content, the results are not identical with the water content of the full quantity. Rather 
we have ten different results in a certain range. This variability of data arises not only from 
the fact that the pieces really have different water contents, but also because the results are 
influenced by three different types of errors which may occur in every set of observations: 


Random errors cannot be avoided. They are due to imprecise measurement, rounding of 
data, environmental effects and not identically repeatable preparation. The amount of 
random errors may be expressed by statistical measures of variability. 


Systematic errors lead to a bias in location, but not necessarily in dispersion. A bias in 
dispersion is obtained if several systematic errors are mixed. 


Example: Every laboratory assistant has usually his own systematic error. This error may be 
relevant or not and it may change with time. But we always expect a greater variability of the 
observations if more than one person have performed the measurements. On the other 
hand, we recognise in this example that a small variability does not necessarily lead to a 
better average value. Environmental effects may be systematic too (e.g. air humidity). 


ae 
Page 440 © Holderbank Management & Consulting, 2000 


: 
“Hoiderbank” Cement Seminar 2000 


Materials Technology Il - Statistics 


Gross errors are "wrong" values in the set of observations. Experience shows that 5% to 
10% of gross errors have to be expected in a data set. Reasons may be: wrong reading of 
scales, errors in copying data, data not legible, miscalculations, gross error in measurement. 
Gross errors have a considerable influence on statistical results. 


Careful measurement and data handling is, therefore, important. Data have to be inspected 
for outliers before a statistical analysis is performed. 


Conclusion: The deviation of the single results from the true mean of the full quantity 


originate from real differences between the samples on one hand, and from the occurrence 
of several types of errors on the other. 





It is often stated that “everything can be proved with statistics", this reasoning clearly is 
wrong; false or often only misunderstood statistics originate from insufficient 
representations, application of wrong procedures or assumptions, or misleading 
interpretation of the results. Especially graphical representations can easily be manipulated. 
It is therefore essential that the applicant of statistical techniques knows what he can and 
what he cannot do! An amusing booklet dealing with such statistical "lies" is "How to Tell the 
Liars from the Statisticians". 


In our days of growing computer use, a lot of powerful (and sometimes less powerful or 
even poor) statistical software packages are available, leading to extensive use of statistics 
procedures by nonstatisticians or people not having enough statistical background. These 
packages manage nearly any instruction without being able to decide whether the statistical 
procedure is appropriate. This bears a great danger of use of statistics by statistical 
amateurs or ignorants. It is therefore absolutely necessary that the user of statistics has a 
solid statistical education. 


2. SIMPLE DATA DESCRIPTION 


In this section some descriptive methods are presented to get a quick survey of the data. 
The methods are illustrated with an example of concrete strength. 


Example 1 
The following data represent the compressive strength of 90 cubes (20 x 20 x 20 cm) of 


concrete. The data listed in chronological order of measurement is called set of 
observations. 


35.8 39.2 36.8 32.4 30.7 30.8 23.5 22.8 23.7 31.7 346 27.6 29.9 28.4 29.3 
33.0 37.6 38.1 33.3 38.9 37.1 33.3 33.4 36.4 443 48.9 40.1 43.4 35.4 36.6 
32.8 34.1 37.4 27.9 30.2 32.0 453 45.8 41.0 26.1 27.9 244 35.3 34.5 36.1 
30.1 40.2 37.9 25.0 23.0 27.8 33.5 34.2 30.0 29.0 35.2 35.8 23.9 34.9 31.5 
35.9 39.7 39.4 324 33.6 35.2 32.8 30.2 31.6 285 285 30.3 31.4 31.8 35.5 
27.1 24.5 20.9 24.6 27.2 31.7 32.2 38.6 32.8 37.8 368 35.3 41.9 34.4 35.5 


Generally a set of n observations is denoted by x,, Xo, ..., Xn where the index j of x; 
corresponds to the number of the observation in the set. 
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2.1 Graphical Representation 

Although the set of observations gives the complete information about the measurements, it 
is little informative for the reader. A better survey is obtained by grouping the data in classes 
of equal length, presented in a frequency table with tally. 

Rules for classification: 

1) The mid points should be impressive values with a few number of digits. 

2) Choose 8 to 20 classes of equal length (approx. Vn; classes) 

3) Class boundaries should not coincide with observed values (if possible) 


absolute relative 

frequenc frequenc 

| 90.0 | 1875-2125 1/1 °° &#2z&| 1 ~~ ~[ ~~ oon 

| 225 | 2125-2375 |m | 4 | 0044 
| 25.0 | 23.75-26.25 |mi =| =e S| Se 
| 27.5 | 2625-2875 {mm | 9 | 0100 | 
| 30.0 | 28.75-31.25 |mw | 10 |  o1i | 
[a25 | 3125-3375 |mmmm | 19 | oo | 
| 35.0 | 3375-3625 |mwmn | 17 | o189 | 
| 375 | 3625-3875 [mun | 41S | tee 
(40.0 | 3875-4125 [wn | 7 -| 0.078 
| 42.5 | 4125-4375 |m =| 2 | one 
| 45.0 | 43.75-4025 |p | | 
| 475 | 4625-4875 | | o | 0000 | 
| 50.0 | 4e75-s125 [1 | 1 | oon | 
| tots | CE CE Tt 00S 


In this representation we recognise directly a minimum of 20.0 N/mm’, a maximum of 50.0 
N/mm”, and an average value between 32.5 and 35.0. The measurements are distributed 
symmetrically about the average value. 


The graphical representation of the tally with rectangles is called a histogram. 
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Fig. 1 Histogram. 
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X = 33,2 N/mm2 


X # 33,5 N/mm? 


The area of the rectangles must be proportional to the tally not the height of the bar, e.g. if 
two classes contain the same number of observations and the width of class 1 is twice the 
one of class 2, then the height in class 1 is half the height in class 2. Same number of 
observations same area of rectangle! 

Two further informative graphs are the frequency curve and the cumulative frequency curve. 
They are constructed as follows: 


Frequency curve 


¢ plot the class frequency (absolute or relative) against the class midpoint (again take into 
account the note for the histogram, same number = same area). 


© Holderbank Management & Consulting, 2000 Page 443 


; 
“Holderbank” Cement Seminar 2000 


Materials Technology II - Statistics 


Fig. 2 Frequency Curve. 
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Cumulative frequency curve 
# cumulate the absolute frequencies less than the upper class boundary for every class 


¢ plot the cumulative frequency in percent against the upper class boundary. 


Fig. 3 Cumulative Frequency Curve. 
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From this graph we can determine the portion of observations smaller than any given 
strength value x. 


Example: Percentage of the observations smaller than 30.0 N/mm: 26 %. Inverse 
problem: Half of the measurements are smaller (resp. greater) than 33.5 N/mm’. 


These two frequency curves are especially suited to compare two or more different 
distributions with one another. 


Fig. 4 Compare Distributions. 
% % 
100 
0 
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Stem and Leaf Plot 


The stem and leaf plot is very similar to the histogram but allows to reconstruct the individual 
data. [The stem and leaf plot is shown in the next picture, the comment does not belong to 
it, but shows how the data are reconstructed. ] 


COMMENT 

20.9 

no observation 

22.8 

23.0, 23.5, 23.7, 23.9 
24.4, 24.5, 24.6 


etc. 
126899 
455 
039 
0122378 
456778 
0244888 
033456 
124569 
2233455889 
14688 
14689 
169 
247 
1 
029 
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Small Samples 
Individual values are marked on the scale 


Fig. 5 Small Sample Table. 


A further attractive possibility to describe graphically the distribution of a variable, the Box 
Plot, will be given in the next section. Often it is appropriate (especially in quality control) to 
plot the observations in chronological order to show a possible change of the level during 
the experiment. 


Fig. 6 Chronological Order. 





10 


5 30 45 60 75 80 Nr. 


lf only few data are available, the single values may be represented as points on the 
measurement scale (cf. example A1, Appendix 1). 


The scatter diagram or scatter plot is used to represent the relationship between two 
variables (paired observations concerning the same individual sample). The following 
diagrams show the dependence of concrete strength in N/mm? from cement/water ratio after 
2 days and 28 days. Different symbols can be used to discriminate groups of observations. 
In the present case two groups are considered: Portland cement (PC) and blended Portland 


cement (BPC). 
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Fig. 7 Scatter Diagram. 
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If we are dealing with more than 2 variables a suggested graphical representation is the 
draftmansplot, which plots all pairwise scattergrams in one picture. 
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X] x2 X32 x4 
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2.2 Statistical measures 


As statistics deals with the behaviour of random variables, i.e. variables which take certain 
values with certain probabilities, we are interested in describing this behaviour of the random 
variable by a few characteristic measures. The first measures we will be considering are the 
p quantiles (or p fractiles), which are roughly said those values for which 100.p % 

(0 <= p<=1) of the data are smailer or equal to. The p quantiles allow a full description of 
the data, similar e.g. to the cumulative frequency curve from Section 2.1. The use of the p 
quantiles gives way to construct a simple and attractive graphical representation of the data, 
the box plot. 


2.2.1 The p-quantile 
Xp is called the p-quantile (or p-fractile) if 


100 © p%(0 <= p<=1) 





of the values of the random variable are smaller or equal to Xp. 


Based on a set of observations from a random variable x, a first method to estimate these 
quantiles is given by the cumulative frequency curve. 


100-p 


0 X 
Xp 
Given a sample x(i), i=1,..,n the empirical cumulative distribution function (cdf) is defined as 
F(x): = (number of the x(i)’s smaller or equal to x) /n 
and the following relations between the p quantile y(p) and the cdf F(x) hold 
a) F(y(p))=p . 
b) Y(p) = min {x(i) : F(x(i)) >= p} 


The p quantile y(p) is roughly said the value for which 100*p% of the observed data are 
smaller or equal to. 
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Estimation of P quantile v(P) 


Denote by z(i), i=l, ...n the ordered sequence of the x(i)’s, e.g. 
2(1) <= 2(2) <= 2(3) <=....’=z(n 1) = z(n) 
and let p(i) = (i 0.5)/n. 
Then the P quantile can be computed as follows: 
1) If P equals a p(i), then z(i) is the P quantile 
Y(P) = 2(i) 
2) Otherwise compute 
j(P) =nP+0.5 


and split this value j(P) into a whole number j and the remaining part B (e.g. 1.347 is split 
into j=1 and B=0.347). The P quantile is then 


y(P)A1— B)z( j)}+B2( j +1) 


An example 
Consider the following ordered sample 


i Z(I i 

1 156 0.0417 
2 158 0.1250 
3 159 0.2083 
4 160 0.2917 
5 161 0.3750 
6 161 0.4583 
7 163 0.5417 
8 166 0.6250 
9 166 0.7083 
10 168 0.7917 
11 172 0.8750 
12 174 0.9583 


12.5 % quantile (P=0.125) 


As P=p(2), our 12.5 %-quantile y(.125) equals the second ordered observation, thus y(.125) 
= 158 


10 % quantile (P=O.1 
Compute j(P)=nP + 0.5 = 12%0.1 + 0.5 = 1.7 
As this yields no whole number our 10 % quantile is (j=1 and B=0.7) 
y(.1) = (1 B) 2(1) + B 2(2) 
= 0.3*156 + 0.7*158 = 157.4 
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25 %-quantile (P=0.25) 
Compute j(P)=nP + 0.5 = 12*0.25 + 0.5=3.5 


As this yields no whole number our 10 % quantile is (=3 and B=0.5) 
y(.25) = 0.5*159 + 0.5*160 = 159.5 
50 %_ quantile (P=0.5 
Compute j(P)=nP + 0.5 = 12°0.5+0.5=6.5 
As this yields no whole number our 10 % quantile is (j=6 and B=0.5) 
y(.5) = 0.5*161 + 0.5*163 = 162 


Some of the quantiles have special names, e.g. 


50 % quantile median (M) 

25 % quantile lower quantile (LQ) 

75 % quantile upper quantile (UP) 

10 % quantile lower decile (LD) 

90 % quantile upper decile (UD) 

OQ UQ Inter quantile range (!Q) 


2.2.2 The Box Plot 


Today the box-plot is perhaps the most frequently used graphical representation for 
univariate data. It offers a condensed picture of the data’s distribution and shows location, 
variability and extremes of the data. 


in order to construct the box plot we need three quartiles: 
¢ the median 

@ the lower quantile 

@ the upper quantile 

The construction is very simple: 


XXX 


L LQ M UQ U 


where L=max (LQ 1.5x1Q, min xi) 
U = min (UQ + 1.5 x IQ, max x)) 
IQ = UQ 1lQ 
x denotes extreme values. 
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This construction is based on the fact that for a normal distribution approximately 1 % of the 
data lie outside the interval (L,U). 

An alternative method also frequently used, would be not to use L and U but instead use the 
lower and upper decile. Especially for non normal distributions this representation seems 
preferable. 

2.2.3 Measures of location 

The arithmetic mean is the mean of all observations and corresponds to the centre of gravity 


in physics. 


x= 


X= 35.8 +39.24+36.84...+434.4+35.5 = 33.199N/ mm? 


90 


The median is the central value. Half of the observations are smaller, respectively greater 
than the median. 


Place the measured values in an ordered array starting with the smallest value 
X yy S Xe S---S Xn 


From this array we obtain the median by 


~ 


x< 


= Xn) ifn is odd 
2 


~_ 1 ent 
X=—|X, +X, ifn is even 
2| GG 


For a great number of observations the ordering is very laborious. In this case the median 
may be determined graphically from the cumulative frequency curve by looking for the point 
on the x axis corresponding to a cumulative frequency of 50 %. 


Mean or Median? 


lf the histogram is symmetrical, mean and median are approximately the same. In this case 
the mean is a better estimate of central tendency if the distribution is not too long tailed. 


If the histogram is skew, mean and median are different. 
@ Use the mean if you are interested in the centre of gravity or the sum of all observations 


¢ Use the median if you are interested in the centre, i.e. with equal probability a future 
observation will be smaller, resp. greater than the median. 
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Example 2 
The following histogram represents the income of 479 persons 
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In this extremely skew distribution the arithmetic mean is quite different from the median. 
Whereas the median represents a typical income, the mean can be used in a projection to 
estimate the total income of the population if the sample is drawn at random. 


The trimmed mean is used to estimate the mean of a symmetrical distribution if gross errors 
in the data are suspected. 


To calculate the a-trimmed mean the a-percent largest and smallest values are deleted. The 
trimmed mean is then the arithmetic mean of the remaining observations. Notation for the 
5%-trimmed mean: Xtimmed 0.05 


Usual values for a: 5 % to 10 % 


The weighted mean is used if certain weighting factors w, are associated with the 
observation x;. Reasons may be: 


- samples of unequal weights 


- observation are measured with unequal precision 
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Schematic survey on the use of measures of location: 
Type of distribution 


JN 


normal — shorttailed 


Measure to be used: arithmetic mean X 


longtailed, symmetrical 


Measure to be used: trimmed mean Xrinmeg 


_N 


extremly longtailed 


LN 


skew distribution 


Measure to be used: median X 





Measure to be used: median or arithmetic mean. The choice depends on the interest of the 
user. 
Remark: The median is equal to the 50%-trimmed mean. 
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2.2.4 Measures of Variability 


The variance is the mean square deviation of single observations from the mean. 


1< — 
s? =——_Y (x, - x)’ 
pays 





For practical computations use 
1 1 
2 2 
3? = xX -—() x, 
Le FL | 
The standard deviation is the positive square root of the variance. 


s= |—-¥ (x, - x)? = 55.596N/ mm? 
n—-1 i=1 


The coefficient of variation is a relative measure of the variability (relative to the mean). 





in our example: 





Ss 
v=— 


Xx 


v is often used for comparing variabilities. It is especially useful if s increases proportionally 
with the mean xX, i.e. v = constant. 


Example of compressive strength: 


= 5:596 
33.199 


The range is the difference between the largest and the smallest value. 


P= Xen ~ Xe) = Xmax ~ Xr 


= 0.169 = 16.9% 





in 
Useful with small sample sizes. 


Note: The range increases in general with increasing sample size n (greater probability to 
get extreme values). 


If the variables under consideration are assumed to be normally distributed then 


the range can be used to provide an estimate of the standard deviation, according to 


s= Py where k=4/n for 3<n<10 
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For sample size n >10 divide the set of observations in random sub samples of size n’, 


calculate the arithmetic mean A of the range in these sub-samples, and estimate s 


according to 
~R 
s~ Fy 


using a k-value corresponding to n’. . For a first rough estimation the following k values may 
be used: 

k~4 forn = 20 

k~4.5  forn=50 

k~5 for n= 100 


The interquartile range is the difference between the upper and lower quartiles. 





Q= X75 — Xo6 


If the distribution is symmetric s = ax 


2.2.5 Statistical program packages 


There are a lot of statistical programs on PCs, microcomputers and mainframes that offer 
the possibility to compute these statistical measures including quartiles and even to plot the 
box plots. The perhaps most prominent under many others are SAS, SYSTAT/SYGRAPH, 
SPSS, BMDP, STATGRAPHICS. 


2.2.6 Interpretation of the standard deviation 


In the histogram of example 1 (chapter 2.1) frequency of observations decreases on both 
sides of the mean. The distribution law seems to be symmetrical. In the present case we 
consider the 90 measurements to be a random sample of a specific distribution model, the 
normal distribution (or Gaussian distribution). This type of distribution is represented by a 
symmetrical, bell shaped curve, and often observed in practical applications. 


The normal distribution is determined by the mean and the standard deviation. 





standard deviation 


X—s5 X X+s 


For this distribution about 68 % of the observation are expected in the interval mean +1 
standard deviation. 


© Holderbank Management & Consulting, 2000 Page 457 


“Holderbank” Cement Seminar 2000 GLEE EET 


Materials Technology II - Statistics 


If the distribution is not normal, the standard deviation allows no direct interpretation. In this 


case we can determine the interval that contains a certain portion of the observations 
graphically from the cumulative frequency curve. 





Another possibility to get an interpretation of the standard deviation in a skew distribution is 
“normalising” of the distribution by transformations. In the case of example 2 (chapter 2.2.3), 
the logarithms of incomes show approximately a normal distribution. So called variance- 
stabilising transformations are especially used to fulfil normality conditions in higher 
statistical analysis. 


For further information consult e.g. Natrella (1963). 
Further properties of the normal distribution are given in section 3. 


2.2.7 Outliers 
As mentioned in the introduction we have to expect about 5 % to 10 % gross errors in a set 


of observations. Most of them may not be recognisable in the region of all other values. 
Some may be extremely outlying values with an important influence on statistical results. 


In modern statistics robust methods are studied, which are not sensitive to a certain portion 
of gross errors, as for example the median or the trimmed mean. More sophisticated robust 
procedures are in general rather complicated. 

If classical measures as the standard deviation and the arithmetic mean are used, we have 
to check the data for outliers and to eliminate them from the set of observations (cf. example 
A2, Appendix 1). 


Note: If outliers are eliminated, they must be recorded separately in the report. 


To detect outliers, check the data plot (tally, histogram or chronological order) for suspicious 
values. If outliers are suspected and the data show a normal distribution, use the Dixon 
criterion for rejecting observations (n<26). 
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The Dixon Criterion 
Procedure 


1) Choose a, the probability or risk we are willing to take of rejecting an observation that 
really belongs in the group. 


2) If 
3<n<s7 Compute rio 
8<n<10 Compute r,, 
11<n<13 Compute ra; 
14<n<25 Compute fo. 
where rj is computed as follows 
r, lf X,,) is suspect If X, is suspect 


ho (Xa — Xn (Xm — Xi) (Xa) — Xy)/( Xm — Xn) 
hy (Xe = Xn (Xm— Xa) (XQ Xy/(Xo-y - Xn) 
Foy (Xen — Xn) YX — Xe) (Xa) — n)/( Xin) ~ Xy) 
lp (Xa ~— Xv) (Xm — Xa) (Xa ~ Xin) /( Xing) — Xn) 
3) Look up f_4/2 for the r, from Step (2), in Table A 2 
4) If > %_a:2 reject the suspect observation; otherwise, retain it. 


In the case of a sample size n>25 use the following procedure: 


1) Choose a, the probability or risk we are willing to take of rejecting an observation that 
really belongs to the group 


Xin) — X 
2) Calculate Z, — O(n) A) 
s 


3) Look for Z,,_, in Table A 3, Appendix II! 


i-a 


(4) If Z, > 2,4, reject the suspect observation, otherwise retain it. 


1-a 


Note: The presented outlier rejecting rules are only valid in normal distributions. In a skew 


distribution, the elimination of outliers is very dangerous and should be avoided. In this case 
the reason for the extreme observation must be known. 





A check for outliers is not necessary if the trimmed mean or the median is used and if we 
are only interested in a location measure. 


For further tests on outliers cf. "Wissenschaftliche Tabellen Geigy, Statistik”. 


The standard deviation is extremely sensitive to outliers. 


A check is therefore important, because it is not allowed to calculate a standard deviation 
from a trimmed set of observations. 
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3. THE NORMAL DISTRIBUTION (ND) 


As mentioned in section 2, the normal distribution is a theoretical model of a statistical 
universe (population), defined by the mean py and the standard deviation o. Graphically it is 
represented by a smooth, symmetric mean x and the standard deviation s of the samples 
are estimated for the true, but unknown values pu and o. 


By the standardisation formula 


xa# (respectively xx 





Z= if n is large) 





the ND is transformed in a normalised form with mean 0 and standard deviation 1. 
Formulae for the standard normal distribution: 
Density function (bell shaped curve): 
1 Z2 
f(z)= Te Oxl-F Ne < Z<0) 


Jon 


Especially of interest is the area under the curve (distribution function), which corresponds 
for every given value z to the probability of an observation to be smaller than z. 


#(z)= pee Joxp(-Z)ae-o(-2)= 1-02 


f(z) 


yy 


Numerical values for z and (z) are given in Table A 1, Appendix III. The probability to 
observe a measured value between two given limits T; and T, can be calculated as follows: 


1) transform the limits T, and T2 in a standardised form 


2) calculate the probability with help of Table A 1 by 
F(Z,,2)) = W(Z,)-9(Z,) 
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Interpretation of the standard deviation in a normal distribution: 


frequency 





“30 -20-1lo0 wu tlio +20 +30 


We expect about 95 % of the observation between ui-20 and +20 . More general we have: 


The statement that the values (of the population) lie between p+Z1.~2. and [t-Zi-02.. is right 
with the probability S = 1 - & and wrong with probability a. One sided or two sided regions 
may be considered. The corresponding z-values are taken from a table of the standard 
normal distribution. 


Usual percentiles for the statistical confidence S: 


a) Two sided, S = 1— 052,5,2, (a/2) 






a ‘/ 9 


¥ 


a 


(sim) sf 90_—sdfs [go 99.9 | 


b) One sided, S=1~@2,,2,,) 
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How to check normality? 


Before using the characteristics of the ND the validity of the model has to be checked. 


Method: Draw the cumulative frequency curve in the normal probability paper (Appendix II). 
Normality can be assumed if the resulting curve is approximately a straight line between 5% 
and 95%. 
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Example: Normal probability plot of compressive strength data 


99 


95 
90 


80 


70 
60 


50 
40 


30 
20 


cumulative frequency % 


10 





20 30 40 50 N/mm? 


Conclusion: The observations of compressive strength follow approximately a normal 
distribution. 


Short cut rule for rejecting normality: If no negative values are allowed in the observations 


and the coefficient of variation is greater than 30 %, the distribution is not normal. 
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4. CONFIDENCE LIMITS 


4.1 Confidence limits for the mean 


The arithmetic mean is an estimate for the true, unknown mean of the population of all 
possible observations. We may be interested in the precision of this estimation. Therefore 
we calculate a confidence interval that contains the true value with high probability 
(confidence level). 


a) If the distribution is normal: 


Two sided confidence interval at confidence level 1-a 
Ss 


(1-a/2;f) Jn 


with 

f =n-1 degrees of freedom 

ti-w24 given in Table A 4, Appendix III (Student t-Distribution) 

Interpretation: . 

With probability 1-a the true mean pu lies between the two confidence limits. 


For n>50 ty.w24 May be replaced by Z1-2 given on page 15 (corresponding to normal 
deviates of Table A 1, Appendix III). In our example of compressive strength we calculate 
the 95% confidence interval by replacing the t- by the z- value: 


= s = s 
X + bo o75-29- = = X £2975 = 
0.97589" 0975" [ 
5.6 
783.2 £1.96 = 33.2+1.16 


b) If the distribution is not normal: 
Approximate confidence intervals can be obtained by 


— Ss 
XE. a12-T— 


This approximation is derived from the central limit theorem and can be used if 


10s? 


N>—=3- 
x2 


c) Range method: 
In the case of a normal distribution, confidence limits may be calculated by use of the range 
instead of the standard deviation (often used in quality control for small sample sizes n). 


XtA_gioh 


Values for are given in Table A 6, Appendix III. 
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4.2 Confidence limits for the median 


Confidence limits for the median can be determined directly from the ordered array of 
observations. 


X11) < Xi) < oaee < X(n-1) < Xn) 


(1-a)-confidence interval: 


Xy <Median < X41) 


; n-1 2, : 
with k =——~—+2/2../n—1 rounded to the next lower integer value 


Z1-w2 is given in Table A 1, Appendix Ill. 


4.3. Confidence limits for the standard deviation 
If the population follows a normal distribution, two sided confidence limits are given by 








<o<s 


. x? 
1-al2:f alat 


with f =n 1 degrees of freedom 


X;_aietXe/24 Critical values of the chi-square distribution given in Table A 5, Appendix III 


Confidence intervals are reduced with increasing sample size n, i.e. the more observations 


available, the better is the estimation. 





The degree of improvement for the arithmetic mean can be derived from the fundamental 
central limit theorem. If several samples of size n are drawn from the same population, the 
arithmetic means of these samples are approximately normal with mean pw and standard 
deviation s/Vn, the so called standard error of the mean (SEM). 


For increasing n to infinity, the standard error of the mean tends to zero, i.e. the estimation 
tends to be absolutely precise if no systematic errors are present. 


4.4 Other methods for the construction of confidence limits 


Sometimes it happens that we have to deal with very complicated functions of random 
variables, for which we can’t derive or know the underlying distribution function, but we 
would like to have confidence limits for the values of this complicated function. A newer 
method, the Bootshap method (B. Efron, 1983), allows to obtain such results, but is rather 
intensive in calculation. 
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5. STANDARD TESTS 


5.1 General Test Idea 





Significant: 

We decide that a difference really exists. The probability of this decision to be false is known 
and can be chosen by the decision maker. A usual choice of this error probability is 5%. 

Not significant: 

The observed difference may be realized by chance alone. The data give no argument to 
suppose the existence of a real difference. If such a difference really exists the sample size 
n is too small to detect it. 

Example 3: 

At a cement plant A, 618 titration samples gave the mean value x, = 82.67 and the 
calculated standard deviations, = 4.13. 

At a later stage a further series of 525 samples were taken and after processing, yielded the 
information of calculated mean value x, = 85.58 and calculated standard deviation 

S, =3.79 


Your decision problem is: 
State whether a significant change has occurred under plant conditions! 


Example 4: 
The production rate of a cement mill in tons/hr was measured as: 
28.3 27.2 29.3 267 29.9 246 25.0 300 263 27.8 


After modification, the production rate of the same mill in tons/hr was measured as: 
28.0 30.0 305 260 31.0 303 246 254 26.7 29.3 


Your decision problem is: 

Has the modification made a significant improvement? 

There are many problems in which we are interested in whether the mean (or another 
parameter value) exceeds a given number, is less than a given number, falls into a certain 
interval, etc. 

Instead of estimating exactly the value of the mean (or another parameter), we thus want to 
decide whether a statement concerning the mean (or other parameter value) is true or false, 
i.e. we want to test a hypothesis Hp about the mean (or another value). 


In example 3 the hypothesis Hp could be 
“No significant change has occurred in the plant conditions" 





In example 4 the hypothesis Hp could be 
“No significant improvement has been made by the modification” 


We will solve the two given decision problems later in section 5.2 (test procedures). 


Now we will consider another specific example introducing at the same time the important 
parts of all similar decision problems. 
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Example 5: 

In the manufacture of safety razor blades the width is obviously important. Some variation in 
dimension must be expected due to a large number of s mall causes affecting the 
production process. But even so the average width should meet a certain specification. 
Suppose that the production process for a particular brand of razor blades has been geared 
to produce a mean width of 0.700 inches. Production has been underway for some time 
since the cutting and honing machines were set for the last time, and the production 
manager wishes to know whether the mean width is still 0.700 inches, as intended. 


We call set of all blades coming from the production line in a certain time interval (t, t+h) the 
statistical population to be studied. For example t = January 5th, 0° and t+h = January 6th, 
0”, if the statistical population we are interested in is the set of all blades produced on 
January 5th. 

If the production process was initially set up on that day to give a mean width of 0.700 
inches, we can say that the hypothesis Hy "the produced mean width of the regarded 
population is 0.700" should be tested. In symbols this is: tp = 0.700 = hypothesized mean. 


Accepting the Hypothesis Ho: 

Suppose we draw a simple random sample of 100 blades from the production line. We 
measure each of these carefully and find the mean width of the sample to be 0.7005 inches. 
The standard deviation in the sample turns out to be 0.010 inches. That is, 


n= 100 
X = 0.7005 inches 
s = 0.010 inches 


For the hypothesis Ho = 0.700 to be true, the sample mean X = 0.7005 inches would have 
to be drawn from the sampling distribution of all possible sample means whose overall mean 
is 0.700 inches. 

Now the important question arises: If the true mean of the population really were 0.700 
inches, how likely is it that we would draw a random sample of 100 blades and find their 
mean width to be as far away as 0.7005 inches or farther? In other words, what is the 
probability that a value could differ by 0.0005 inches or more from the population mean by 
chance alone? 


If this is a high probability, we can accept the hypothesis that true mean is 0.700 inches, 
because it is very easy to get it (high probability). 

If the probability is low, however, the truth of the hypothesis becomes questionable because 
the sample we got is in reality very seldom. 


To get at this question, compute the standard error of the mean from the sample: 


S_= 0.010 = 0.001inches 


* In i060 


S. 
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Since the difference between the hypothetical mean and the observed sample mean is 
0.0005 inches and the standard error of the mean is 0.001 inches, the difference equals 0.5 
standard errors. By consulting Table A-1 (Z) = 0.5) we find that the area within this interval 
around the mean of a normal curve is 38%, so that 100 - 38 = 62 % of the total area falls 
outside this interval (cf. dashed lines below).If 0.700 inches were the true mean, therefore, 
we should nevertheless expect to find that about 62% of all such possible means would, by 
chance alone, fall as far away as 0.5 s, or farther. 


Therefore, the probability is 62% that our particular sample mean could fall this far away. 
This is a substantial reason to accept the hypothesis and attribute to mere chance the 
appearance of a 0.7005 inches mean in a single random sample of 100 blades. 


Naturally we reject in the same time the contrary of Ho namely that: 


My if # 0.700 





098 -699 -700 -70) ~702 
__- 
Ho* 055g 
—_ 
fy £ Sng 


in case Hy can not be rejected, avoid saying “it is proved that Ho is correct”. or "there are no 
differences in the means”, say "there is no evidence that HO is not true" or "there is no 
evidence Hy should be rejected”. 


Rejecting the Hypothesis Ho: 
Later, after production has gone on for some time, the query again arises: 


Is it reasonable to believe that the true mean width of blades produced remains 0.700 
inches? Since the process was adjusted to yield that figure the hypothesis still seems 
reasonable. We could then test it by taking another random sample of 100 blades. 


This time the standard deviation is still 0.010 inches, so the standard error of the mean is 
still 0.001 inches, but the mean is now 0.703 inches: 


In order to test the hypothesis that the true mean of the population is 0.700 inches, we again 
go through the same line of reasoning. If the true population mean really were 0.700 inches, 
how likely is it that we should draw a random sample of 100 blades and find their sample 
mean to be as far away as 0.703 inches? 


Since the difference between the hypothetical mean of 0.700 inches and the actual sample 
mean of 0.703 inches is 0.003 inches, and the standard error of the mean is 0.001 inches, 
the difference is equal to three standard errors of the mean (i.e. 0.003/0.001 = 3). 


“2p =0.5 p = 0.69 = 69%. Therefore 31% fall outside to the right. The same portional fall outside to the left of 
“Zp 


Page 468 © Holderbank Management & Consulting, 2000 


. 
“Holderbank” Cement Seminar 2000 HOLDERBANK: 


Materials Technology II - Statistics 


Now if 0.700 inches really were the population mean, we know from Table A-1, Appendix III 
that 99.7% of all possible sample means, for random samples of 100, would fall within three 
standard errors around 0.700 inches. Hence, the probability is only 0.3% that we would get 

a sample mean falling as far away as ours does. 

We have two choices: 


1) We may continue to accept the hypothesis (i.e. leave the production process alone), and 
attribute the deviation of the sample mean to chance. 


2) We may reject the hypothesis as being inconsistent with the evidence found in the 
sample (hence, correct the production process). 


Either of two things is true and we have to make a decision between them: 


1) the hypothesis is correct, and an exceedingly unlikely event has occurred by chance 
alone (one which would be expected to happen only 3 out of 1000 times); or 


2) the hypothesis is wrong 
Type | and Type li Errors 
Understandably, the question can be raised: What critical value should we select for the 


probability of getting the observed difference (x - 19) by chance, above which we should 
accept the hypothesis HO and below which we should reject it? This value is called the 


critical probability or level of significance. 


The answer to this question is not simple, but to explore it will throw further light on the 
nature and logic of statistical decision making. Let’s study the following example: 


in HO is true HO is false 


In reality Hy is true Ho is false 
We decide 


Accept Ho right decision error Il 
accept a true accept a 
hypothesis false 


hypothesis 


Reject Ho error | error Ii 
reject a true accept a 
hypothesis false 

hypothesis 





Example: 

Ho: the parliament building burns 

Decision maker is the commander of the fire brigade. 
Another expression for error | is: error of first kind 
Another expression for error II is: error of second kind 
lf we ask here, what is worse 

@ error! or error Il, 


then error | naturally costs a lot of money because the fire will destroy the whole parliament 
building. Error || only moves the fire-brigade. 


In a long run of cases which the hypothesis is in fact true (although we do not know it is true, 
for otherwise there would be no need to test it), we will necessarily either be wrong as in 1 or 
right as in 3. 
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That is to say, if we make an error it will be of Type I. 


Suppose we should adopt 5% as the critical probability. accepting the hypothesis when the 
probability of getting the observed difference by chance exceeds 5% and rejecting the 


hypothesis when this probability proves to be less than 5%. This amounts to the decision to 
accept the hypothesis when the discrepancy of the sample mean is less than 1.96 standard 
deviations, and to reject the hypothesis when the discrepancy is more than 1.96 standard 
deviations. 


If we take 1% instead of 5%, as above, we will get as limit 2.58 standard deviations. 


In fact, the percentage of cases in which we would expect to make an error of the first kind 
is precisely equal to the critical probability adopted. 


(The probability of error | will be abbreviated quite often by a). 
Just significant probability level: 


In many studies the critical probability is used to describe the statistical significance of a 
sample result. For example, an economist collects some data on, say, interest rates and the 
demand for money. He hypothesizes some relationship and wishes to see if the data 
support his thesis. He tests the hypothesis to rule out the alternative that the observed 
relationship occurred by pure chance. He reports his sample results as "significant at the 1 
percent level". Such a statement is a report to the reader that has the following meaning: 


1) if we were to set up a statistical hypothesis 
2) if we were to test this hypothesis using a critical probability of 1% 
3) then we would reject the hypothesis and rule out a chance relationship 


Significance levels of 10%, 5%, 1%, 0.1% are often used in reporting sample data. The 
smallest of these probability values is chosen at which the hypothesis can be rejected. 


So we see now what is basic for every statistical test: 
1) we need a clear hypothesis Ho 
2) we have to know by which statistic we want to test Ho (in our example it was x) 


3) we need a criterion C for decision making in the following form: 

reject Ho if C applies 

accept Ho if C does not apply 
The criterion C is usually given in form of a critical limit, called significance limit, which 
should not be exceeded by the calculated test statistic. 
To perform such a statistical test, it is necessary to select the risk to commit a type | error, 
i.e. the significance level a (see section 5.2, p. 28). 


Often a relevant difference has to be detected with a certain probability, i.e. with a 
predetermined type I! error B. This is only possible with a certain sample size n, as is further 
outlined in section 5.3. 
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If we reject a hypothesis when it should be accepted, we say that a Type | error has been 
made. If, on the other hand, we accept a hypothesis when it should be rejected, we say that 
a Type Il error has been made. In either case a wrong decision or error in judgement has 
occurred. 


In order for any tests of hypotheses or rules of decision to be good, they must be designed 
so as to minimize errors of decision. This is not a simple matter since, for a given sample 
size, an attempt to decrease one type of error increases the other type. In practice one type 
of error may be more serious than the other, and so a compromise should be reached in 
favour of a limitation of the more serious error. The only way to reduce both types of error is 
to increase the sample size, which may or may not be possible. 


Note that from the philosophy of testing there is always the possibility to reject Hp although 
Ho effectively is true (the probability for this is a, the type | error probability). So if you are 
testing say 100 “correct” datasets (i.e. for which Hp holds) to a significance level a = 5% you 
will expect about five results that reject Ho, although it holds. This has to be considered if a 
lot of (Statistical) tests are made on the same data material (see e.g. Multiple Testing ....... 
.”Simultaneous Statistical Inference”, Miller, 1981). 


One-sided and two-sided tests 


in example 5 we are interested in values of significantly higher or smaller than po. Any such 
test which takes account of departures from the null hypothesis Hp in both directions is 


called a two-sided test (or two-tailed test H, yu = W),A: Ml # My) 


However, other situations exist in which departures form Hp in only one direction is of 
interest. In example 4 we are interested only in an improvement of production rate due to 
the modification of the cement mill and so a one-sided test is appropriate. It is important that 
the decision maker should decide if a one-tailed or two-tailed test is required before the 
observations are taken. 
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5.2 Test Procedures 

Performance of a statistical test requires in advance the recognition and formulation of the 
present decision problem (test situation). Use the following checklist for applications of 
statistical tests: 


CHECKLIST 


1) Is the decision problem concerned with the mean. median. standard deviation. 
correlation coefficient or other? 
Formulate a clear hypothesis Ho. 


2) Is ita one-sample or a two-sample problem? 
One-sample problem: 
Mean, median or standard deviation of a given sample of size n is compared with a 
hypothetical value (example 5). 
Two-sample problem: * 
The decision problem is concerned with differences between two given samples. 


3) For the two-sample problem: Are the observations independent or paired? Observations 
are paired if every value in the first sample can be attached definitely to a value in the 
other sample. Paired comparisons are in general considerably more efficient than a 
comparison of independent samples. 


4) Is the sample drawn from a normal distribution? 


5) Decide between one-sided or two-sided test. 
One-sided tests are used if differences only in one direction may occur or are of interest. 
Two-sided tests are used if no preliminary information is available, in what direction the 
sample may differ. 

6) Choose the test procedure in the table “TEST CONCERNED WITH” (sample sizes n<50 
are considered to be small). 


7) Choose the significance level a. Usual choices are a = 5% or a = 1% (error of type 1). 
8) Calculate the test statistic T of the chosen test (cf. the formulae of the following pages). 


9) Look for the significance limit T, for the test statistic in the corresponding table with p = 
1-a/2 (two-sided test) or p = 1-a (one-sided test). 

10) Decision: If the calculated test statistic exceeds the significance limit, reject the 
hypothesis Hp (the observed difference is significant at the level «). Otherwise there is 
no reason to reject the hypothesis Hp and the difference is considered to be not 
significant. 
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TEST CONCERNED WITH 


test situation mean general | normal mean 
distribution standard 
deviation 


One-sample problem 
small n (n<50) sign test signed-rank-test x’-test 


large n (n>50) sign test z-test x*-test 
Two-sample problem 


independent 
small n median test Wilcoxon-test F-test 


large n median test z-test F-test 
paired 

small n sign test signed-rank-test 

large n sign test z-test 


More than two Krukskal- Wallis- | Analysis of Bartlett-iest 
samples test variance (ANOVA) 
Friedmann-test 





These tests are not treated in this 


Selected test statistics 


5.2.1 z-Test 
Used to test means with large sample size n. 


a) One-sample-problem: 
Given is a sample of size n with arithmetic mean xX and standard deviation s, we test the 


hypothesis that the mean (estimated by xX ) is equal to a given or target value Up 
Hypothesis Hp [=o 


a x- n 
Test statistic z= (X= Wn 
oO 


In general o is not known. It can be replaced by s for large sample sizes. 
b) Two independent samples: 
Given are two samples of size n, and np with arithmetic means X,,xX, and standard 


deviations s;, So. 
Hypothesis Hp = Hy = He 
x, ~~ Xp 
2 2 
& Ss 
1 + ~2. 
n nm 


Test statistic z= 


c) Paired comparison 
Given is a sample of n paired observations x,, y;. Calculate the arithmetic mean d ofall 
the differences d, = y, - X,;and the standard deviation sg. 
Hypothesis Hp x = Hy respectively y= 0 


dvn 
Sy 





Test statistic z= 
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Significance limits 

Often used significance limits for Z,.. (one-sided test) and Z1..2 (two-sided test) are 
Zo95 = 1.645 20.975 = 1.960 
Zo 99 = 2.326 Zo 995 = 2.576 

For other significance levels a look for Z;. in Table A-1. 

Decision 

The difference is significant if 


IA>Z,4/2 two sided test 


AZ. 


one-sided test 
ec 


Example 


a) Inthe example 1 we test the two-sided hypothesis Ho: 
U = 35.0 N/mm’ = ip (one-sample problem). 


X = 33.2 

s=5.6 

n=90 

a= 0.05 

ze (33.2 —-35.0)v90 ~_305 


5.6 
Izl = 3.05' 1.96 = 20.975 
Decision: The sample mean differs significantly from the standard strength 35.0 N/mm?. 


b) in example 3 (chapter 5.1) we test the hypothesis whether a change in plant conditions 
has occurred or not (two independent samples): 





Ho: Hi = He 
n, = 618, No = 525 
xX; = 82.67, Xo = 85.58 
S$; = 4.13, Se = 3.79 
a = 0.05 (two-sided test) 
85.58-82.67 
_ 85.58 -82.67 
(4.13)? + (3.79)? 
618 525 


Z=12.41>1.96 = 297; 
Decision: Highly significant difference between the two samples of titration values. 
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5.2.2 Sign Test 


Test for the median in one-sample problems and paired comparisons. 


a) One-sample problem: Given is a sample of size n. 
We test the hypothesis that the median of the population is equal to a given value mo 


Ho: median = Mo 
Test statistic: 
Count the number of observations smaller than my and those larger than mo. The test 
statistic k is then the smaller of the two numbers. 
b) Paired comparison: Given are n paired observations x, y;. 
We test the hypothesis that x; and y; have the same median. 
Test statistic: 
For each pair of observations xi, y; record the sign of the difference y; - x; . 
The test statistic k is then the number of occurrences of the less frequent sign. 


Significance limits 
n —_ 1 24 0/2 . 
Calculate k,,. = > ~~ n—1 (two-sided test). 


For the one-sided test replace o/2 by a. Z, is the significance limit of the z-test 
295 = 1-645 25975 = 1-960 


Decision 
If k is less than kay2 (resp. k,) conclude that the medians are different, otherwise, there is no 
reason to believe that the medians differ. 44 


Example 
Test the hypothesis that 50% of the population has an income of more than 3000 (example 
2). 


Ho: Mo = 3000 
k=217 
n= 479 


a = 0.05 two-sided test 
kyo06 = = _! = => J478 = 217.6 


The test is just significant at the 5% level. Because the sample median is 2700, we conclude 
that less than 50% of the population has an income of 3000. 
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5.2.3 Signed-rank test 

Test for the mean in symmetrical distributions with small sample sizes n (one-sample or 

paired comparison). 

a) One-sample problem: Given is a sample of size n from a symmetrical distribution. 
Hypothesis Hp : U = Ho 


Disregarding signs, rank the d; according to their numerical value, i.e., assign rank 1 to the 
smallest observation, assign the rank of 2 to the d, which is next smallest, etc. In case of 
ties, assign the average of the ranks which would have been assigned if the d's had differed 
only slightly. (if more than 20% of the observations are involved in ties, this procedure 
should not be used). 


To the assigned ranks 1, 2, 3, etc., prefix a + or a - sign, according to whether the 
corresponding d; is positive or negative. The test statistic T is then the sum of these signed 
ranks. 


a) Paired comparisons: Given are n paired observations X,, yj. 
The hypothesis is tested that both have the same mean. 
Test statistic: Compute d; = y; - x; for each pair of observation and continue as in a). 


Significance limits 
Look for Togs or To.975 in Table A-7. If the number m of differences d; exceeds 20 perform a 
z-test with 


_ T 


41 = [m(m+1)(2m+1) 
6 


Decision 

Conclude that the means differ if 
I1= Tare (OMzr|2 Zare) two-sided test 
T2T_. (orz, 2Z,_,) one-sided test 


T<-T,  (orz,<-z_,) one-sided test 
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5.2.4 Wilcoxon-Test 


Test for a comparison of means in two independent samples: 


Given are two independent samples of size n; and ne. 
We test the hypothesis that both samples have the same mean 
Ho-1 = 2 


Test statistic: 

Combine the observations from the two samples, and rank them in order of increasing size 
from smallest to largest. Assign the rank of 1 to the lowest, a rank of 2 to the next lowest, 
etc. (Use algebraic size, i.e., the lowest rank is assigned to the largest negative number, if 
there are negative numbers). In case of ties, assign to each the average of the ranks which 
would have been assigned if the tied observations had differed only slightly. (If more than 
20% of the observations are involved in ties, this procedure should not be used). 


Let n,; = smaller sample 
Np = larger sample 
Nn=Nny;+Ne 


Compute R, the sum of the ranks for the smaller sample. (If the two samples are equal in 
size, use the sum of the ranks for either sample). 


Compute W = 2R - n,(n + 1) 


Significance limits 
Look for Wog9s5 or Wo97s in Table A-8. For sample sizes not mentioned in the table perform a 
z-test with 


_ WwW 
Decision 


Z, = —_ 
" [n.n,(n+ 1) 
3 
Conclude that the means differ if 


|W) 2W,212 (0r|Z,|2 2-9/2) two-sided 


WeW_, (orz,22Z,,) one-sided 


W<-W,, (orz,<-Z_,) one-sided 
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Example 


In example 4 (chapter 5.1) we are interested in an improvement of production rate after a 
modification of the cement mill. 


a = 0.05 (one-sided test) 
Combined sample (values in italics correspond to the sample after modification): 


tons/hr 246 24.6 25.0 254 260 26.3 26.7 267 27.2 27.8 


rank 1.5 15 3 4 9 10 


o [NO 
D 
N 
on 
wl 
on 


tons/hr 28.0 28.3 29.3 29.3 29.9 30.0 30.0 30.3 305 31.0 
rank 11 12 13 14 15 16.5 165 18 19 20 


The sum R of ranks in the second sample is 
R=1.5+44+5+7.5+114+134+ 16.5+ 18+ 19+ 20=115.5 
W=2.115.5-10.21=21 

WwW =21< 46 = Woos 

Decision 


The improvement is not significant. We have no reason to assume a real improvement due 
to the modification. 


5.2.5 t-Test 


This test is used for comparisons of means in normal distribution with small sample sizes n. 
If the distribution is not known to be normal, prefer rank tests (Wilcoxon, signed ranks). 





The test situation is the same as in 5.2.1 (z-test) refer for remarks on hypothesis and 
decision. Instead of z;.. look for significance limits t;.. in Table A-4, with n-1 degrees of 
freedom (df) in the one-sample case or for paired comparison, and with n1 + n2 - 2 degrees 
of freedom (df) for the two-sample case respectively. 


One-sample problem: Given is a sample of size n drawn from a normal distribution. 
» (X-Men 
s 


Two independent samples: Given are two independent samples of size n; and n2 from two 
normal distributions with means W;, H2 and equal standard deviation o. 


t= X, — X, | nn, 
(n,—1)s? +(n, —1)s5 ¥ +, 
(1, +N, —2) 


Paired comparison: Given are n pairs of observations xj, y,, when x and y follow a normal 
distribution. 


t= Jn 
Sy 
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5.2.6 Median-Tests 


Median tests for two independent samples are not given here in an explicit form. For small 
sample sizes n Fisher's test for 2 x 2 contingency tables and for large n x’-test for 
contingency tables may be used (Natrella 1963, Noether 1971). 

In a comparison of two independent samples we are often not interested in differences only 
of means or medians, but generally in location differences of the two samples. 

In this case use the tests for means, which are more efficient than median tests (Wilcoxon- 
test or z-test). 


5.2.7 X?-Test 





X’-tests are also used for 
¢ Goodness of fit test 

@ Independence test 

¢ Loglinear models 

(see e.g. Haberman, 1978) 


Given is a sample of size n drawn from a normal distribution with mean yw and standard 
deviation o. We test the hypothesis that the standard deviation o, estimated by s from the 
sample is equal to a hypothetical standard deviation oo. 
«gs n—1)s? 
Ho: 6 = po Test statistic: x? = (n=)s" 
0 


Significance limits 
Look for significance limit X?om, in Table A-6 with m = n-1 degrees of freedom (df). 


Decision 
If X?>X* vem OFX’ < X*vom conclude: c=, (two-sided) 
eX? > X71 om conclude: o>6) (one-sided) 
xX? < Xm conclude:o <6) (one-sided) 


Otherwise we have no reason to believe that o differs from op. 


5.2.8 F-Test 


Comparison of the standard deviation in two independent samples. 


Given are two independent samples of size n, and n2 with standard deviations s; and So. 
Both samples are drawn from a normal distribution. We test the hypothesis that both 
populations have the same standard deviation. 


Ho: 6; = Go 
Test statistic: 
Let be s; > Sp, then compute 
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Significance limits 
Look for F4-cemim2 (one-sided) OF F1-a/2m1,m2 (two-sided) in Table A-9 with m, = n,-1 and m2 = 
nz-1 degrees of freedom. 


Decision 
Conclude 01 # Oo if F > Fyqomime (two-sided) 
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5.3 Sample Size Determination 

As stated at the end of chapter 5.1, the probability B of making a type II error in a given test 
with significance level « (type | error) depends upon the sample size n. 

How can we determine the sample size that is necessary to hold the type II error within 
certain boundaries (probability B)? Or in other words: What sample size n is necessary to 
detect a relevant difference with great probability (1-8)? 

- Let us first examine a one-sided one-sample-test for testing a mean: 

Suppose we are interested in the mean p of a random variable X. We want to test the 
hypothesis Ho: U = Ho against the alternative hypothesis A: py > Uo with a z-test. The error- 
type-one shall be a. 

The probability to detect a deviation of A from [ip shall be at least (1 - B), if this deviation 
exceeds a preselected relevant difference 6 (B is now an upper bound for the type II error). 
The following figure shows the distribution of the sample mean xX under Hp and under the 
(special) alternative hypothesis A: = Uy + 8: 





Interpretation of the graph: 


@ C., is the criteria (significance limit) of the test: 
The hypothesis Hg is rejected with probability « even if it is true. 


¢ On the other hand, the probability to accept Ho: u = Uo, even if the true mean is p = ty + 
5 (type II error), is greater than B. 


¢ The relevant difference 5 may only be detected with probability (1-B), if Cu. < Ca; in this 
case the risk to commit a type I! error is < B. 


2 
. o sags at . 
The variance —— of the two distributions becomes smaller with increasing n and hence Ca 
n 


moves to the right and C,,, to the left. Therefore, we can find an n such that Cy, < Ca: 
C,, $C, 


r+ o= + 6 (1) 
Za" rz HaSZy Fa Hy + = Ze Hot 
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Sle +2, 4)S Jn 


2 2) 
o ( 
orn2 gr (Zr2 +Z5)° 
n corresponds with the required sample size to detect a desired difference 6 with probability 
(1-8) by a z-test with significance level a. 
For the two-sided one-sample-test we only have to replace a by o/2, and we get: 
2 3) 
o ( 
2 ge (Free +Z.,)° 


Note: B is not replaced by 8/2! 


In a similar (but somewhat more complicated) way we find lower bounds for the sample 
sizes n, and nz satisfying our conditions in the one-sided two-sample-test: 


2 2 4) 
O° 0,+05 2 ( 
1) 2 2 (2, + Za) 
O 
> O 
n,2—n, 
O02 


For equal standard-deviations (o; = 62 = 0) we obtain from (4): 

20° (5) 
n=n2 st (A. + Za) 

In the two-sided two-sample-test we simply have to replace a by o/2 in (4) or (5). 


The following table shows short rules for determining sample sizes when a = B = 5% and oj. 
= 62 = 6 in the two-sample-case. The numbers in parentheses refer to the formulae from 
which the ee are derived. 


| | one sample | two samples 


: 7 
two-sided 130? 
n2—3 
o 
3 


with 5 = relevant difference to be detected with probability (1-B) 
a=B=5% 
Note: 


¢ The variance o” is usually not known, but often some knowledge about o” is available 
from former experiments (standard deviation). Otherwise o” may be estimated in a pilot 
study. 


¢ Inthe case of small sample sizes (t-test, Wilcoxon-test), add 5% to the calculated n. 
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6. DATA PRESENTATION AND INTERPRETATION 
(Chambers, Cleveland, Kleiner and Mikey, 1983). 


The problem of data presentation and interpretation is common to cement manufacturers 
and users. In every stage of cement, aggregate, concrete production and application, data 
concerning materials, equipment, energy consumption, market situation, costs, etc. are 
generated. All this information must increase the knowledge about what happens in the 
process and market, thus providing the basis of decision. Moreover, communication 
between supplier and consumer has to rely on this information. We consider it, therefore, 
vital not only for optimum manufacture of cement and cement based products, but also with 
respect to the mutual relation between manufacturer and user that appropriate attention is 
given to data handling, i.e. presentation and evaluation. 


We may differentiate between three levels of information: 

@ data needed for direct process and quality control (routine decision, off- or on-line} 
¢ data required for day-to-day management decisions based on quality reports 

¢ additional data allowing long-term improvements and developments. 


With respect to presentation and interpretation of test results, the main problems for piant 
management are: 


@ how to organize the required information flow so that decision can be made at all levels 
of competence 


@ to reproduce and evaluate the corresponding data in a specific situation. 


The relevant data of ali domains cannot be made available without a functional information 
system. But availability alone does not necessarily result in a rational decision based on the 
data. To achieve this, two further aspects must be considered: first, the data should be 
presenied in an intelligible form, i.e. a high transparency of the results enables the 
management to recognize certain relationships or critical results in time. Secondly, the 
decision maker must be aware of accuracy, significance and relevance of the considered 
data in order to provide a realistic interpretation. 


Availability of data 


A quick availability of data does not necessarily require a fully integrated data based system 
with electronic data processing, but is rather a matter of organization. Important data (e.g. 
for process and quality control) should circulate with little loss of time. Consequently, an 
appropriate reporting system (organization) must be established. Additional data should not 
disappear in some drawer where a later retrieval is impossible or at least very inconvenient. 
To avoid such a disorder, all data are recorded in a similar way, including a note on the 
circumstances of measurement and provenance of samples and test results. Remarks 
about circumstances and provenance are used to judge the comparability of different sets of 
observations. A standardised recording procedure makes data surveying easy and simplifies 
a later change to electronic data processing. 
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6.1 Intelligible Presentation 


The main aim of data presentation is transparency rather than secrecy, i.e. graphs should 
be employed instead of tables. A graphical presentation gives a quick survey on relevant 
information such as changes in time, extreme values, relationship between variables. A 
short survey on frequently used graphs can be found in section 6.4. It is recommended to 
produce the graphs directly at the source of the data in the form of tallies and/or control 
charts. A control chart may already be included in the laboratory journal and reports, directly 
behind the columns for sample identification and test results. The possibility of obtaining a 
quick survey by consulting a well presented graph will not only help the management in its 
decision making, but transparency will also improve interest and motivation of the personnel 
at any level of competence. 


6.2 Interpretation 
First of all. we must know how much we can rely on.the data. 


Test results are never "true" values, but are rather subject to errors of three types (see also 
section 1): 


¢ random errors caused by sampling, imprecise measurement, environmental effects, etc. 


# systematic errors caused by bias in sampling or process measurement, by the use of 
inadequate experimental design 


# gross errors caused by recording wrong or not comparable values. 


Usually, we expect 5 to 10% gross errors in a set of observations. Random errors may be 
denoted as reproducibility and expressed as the corresponding standard deviation. In 
general, these errors are underestimated. Reproducibility should be known for every 
important analyzing method. Systematic and gross errors are difficult to characterize. They 
should be minimized by careful experimentation and data handling. A periodic check for 
systematic errors should be done (calibration, comparison with a standard, inter-laboratory 
test). 


How to compare several data sets or groups of data? 


Before any comparison is made, we have to seriously check the comparability of data. 
Often, data sets differ in provenance of samples or circumstance of measurement, so that a 
comparison ma y be impossible. Even the fact that sample 1 was measured by laboratory 
assistant Miller and sample 2 by Brown will lead to a biased comparison if there is any 
relevant systematic error between the two persons. If the data sets are comparable and we 
observe a certain difference, we have to ask the following questions before taking any 
action: 


a) ls the difference significant? 
b) Is the difference relevant? 


The problem of significance is answered by a statistical test. If it is not significant, we have 
no reason to take any action because the observed difference may occur by chance alone. 
lf the test indicates a significant difference, it is not necessarily relevant for the problem we 
are concerned with. Of course, the decision whether or not it is relevant is not a statistical 
problem. Perhaps a decision maker is alarmed when an observed difference, considered to 
be relevant in the present problem, does not jead to a significant test result. In this case, the 
sample size used may be too small or the testing procedures may not be sufficiently 
sensitive to solve the given problem. 
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Consequently, experience in product manufacture and application is necessary to assess 
the relevance of difference between target quality and experimental values. However, to 
judge whether it is significant requires a training in statistical technique, particularly test 
procedures. Decisions based simply on either practical experience of many years 
(relevance) or on statistical technique and procedures (significance) will lead to too frequent 
and unnecessary actions in both the manufacturing process and product application. 


6.3 Data Interpretation related to Problems in Cement Application 


If confronted with the problem of finding the reasons for poor quality of cement related 
products (concrete, asbestos cement, etc.), the cement is often suspected to be the cause. 
This can be explained by the fact that cement is the binding agent, and in the majority of 
cases constitutes the most expensive component, although it is known that the effect of 
other components, proportioning and curing conditions etc. are of great importance. A 
further reason to inspect the cement first is the difficulty to specify the other effects while 
cement is well defined by its chemical and mineralogical composition and its fineness. 


To find out the causes for inferiority or changes in quality, a collaboration of consumer, 
producer and statistician is indispensable: the statistician may be omitted in uncomplicated 
problems if delegates of consumer or producer are well trained in statistics. In a first 
retrospective analysis of available data, parallel changes of parameters and quality are 
studied. This is done by drawing scatter diagrams and performing a regression analysis. 
The disadvantage of retrospective studies is the difficulty to find out causal relationships. 
The effects of several variables are mixed and cannot be separated due to their causal 
origin. On the other hand, an observed correlation indicates a possible causal effect. The 
decision about what cause is really responsible must be made by an experienced specialist 
and not by a statistical test. Often a decision is not possible because dependencies are too 
complex. In this case, a special experiment has to be planned and performed with a 
controlied variation of suspected variables and careful elimination of interfering effects 
(prospective analysis). In contrast to the retrospective analysis, an adequately designed 
experiment renders it possible to evaluate and judge causal effects with statistical methods. 
A short survey on the use and interpretation in regression/correlation analysis and designed 
experiments is given in chapter 7. 
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Survey on Graphical Data Representation 
Representation of one sample - documentation 
Frequency table / Taliy 


Given is a sample of large size n. Observations are grouped into classes of equal length 
and marked in the tally. 


Example: 90 values of concrete strength 


tally absolute relative 
frequency frequency 










ras fees mrlew | _s | oo 
[3.0 fens anspew [ie [oan 
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Histogram 
Graphical representation of the tally. Visualization of minimum, maximum, center and shape 
of the distribution. 


Area of rectangles corresponds to absolute or relative frequencies in the classes. 


15 


10 
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Frequency curve 


Relative (or absolute) frequencies plotted against class mid point. Every point on the curve 
corresponds to the relative (or absolute) frequency of observations falling in this class. 


20 

15 

10 

5 

0 Zi 

20,0 30,0 40,0 50,0 N/mm? 


Cumulative frequency curve 


Cumulated relative frequencies plotted against upper class boundaries. Every point on the 
curve corresponds to the portion of values which are smaller than any given strength x. 


Example: Half of the measurements are smaller (respectively greater) than 33.5 N/mm? 
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Stem-and-leaf Diagram 
Similar to the histogram, but allows to see the individual data. Each observation is a leaf of a 
stem. 


zO0 9 

mL 

zz 8 

23 O579 
24 456 

z5 oO 

26 1 

2?’ 126899 
<8 455 
29H O39 


30 0122378 
31 456778 
32 0244888 
33M 033456 
34 124569 
35 2233455889 
3SGH 14685 
37 14689 


38 169 
39 247 
40 12 
41 o9 
42 
43 4 
44 3 
45 


38 
***OUTSIDE VALUES*** 
48 9 


Representation of small samples 


For small sample sizes n the individual values are plotted directly on the measurement 
scale. Suitable to detect outliers and skewed distribution. 


0,5 1,0 1,5 2,0 
K,O (%) 


Example: K2O-content of 10 cements 
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Time-plot 


Often it is appropriate to draw the observations in chronological order to show a possible 
change of the level in time. 


Useful in annual reports. 


N/mm2 
50 


40 


30 


20 





10 
o 15 30 45 60 75 90 NO. 


Cumulative time-plot 

Cumulated values are plotted against time. Used to show deviations from a cumulative 
target. 

Example: Actual clinker production is cumulated every month and compared with a target. 


ws 
6 
& 





tons of clinker 


actual production 


JFMAMJJASOND 
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Box-Plot (Box-and Whisker-Plot) 


Special and frequently used plot to numerize data. Shows median, lower and upper quartile 
(forming the box) and two lines (the whiskers) extending to the extremes of the data. If 
unusual extreme values occur, the whiskers extend only to those points that are within 1.5 
times the inter-quartile range. 


xX> 





L LQ 6M UQ 


Scatter-plot or x-y plot 
Piots two variables against each other. 





39 1.3 1,3 1,7 1,9 2,1 
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Draftmensplot 
Plots two or more variables against each other! 
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Flury-Riedevyl faces and Chemoftf faces 


Faces are a method to show multivariate data graphically. Each variable is mapped to one 
or more of the face parameters. Other methods to represent multivariate data are the star 
symbol plot, the sun ray plot, castles and trees. 
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6.4 Control Charts 


Control charts are special time-plots to show a possible change of the characteristics in a 
production process. The essence of a control chart is its clearness and intelligibility. In a 
quick survey it is possible to recognize natural subgroups, within which variation is likely to 
be random but among which assignable causes of one sort or another may cause non- 
random variation. 


Summarized statistical errors (such as means or standard deviations) and confidence limits 
determined from overall sample data can be very misleading, if the data are not free from 
the effects of assignable causes. In cement industry annual means and standard deviations 
are mostly misleading due to a nonstationarity of the process, i.e. the occurrence of 
systematic changes at the production level. (Example: mean and standard deviation of the 
lime saturation of raw meal and/or clinker in annual reports. Changes in the target value 
required by the process or the product lead to a high non-interpretable standard deviation.) 


lf possible, the data should originally be collected with this subgrouping in mind; in such 
cases, the analyses can be simplified by arranging for an equal number of observations in 
each subgroup. If the data must be analyzed as they come, subgrouping may still be 
possible with knowledge of the data’s source, i.e. obvious changes in the process conditions 
must be registered together with the data. 


Control limits 


To judge a significant change at the production level, the charts are completed with control 
limits. As a general rule it can be assumed that such a change is present, if a sample point 
falls outside of 3 o-limits, where o is the standard deviation of a homogeneous production 
phase. 

This engineering rule has been found to work well. No exact probability is given for a chance 
variation beyond 3 o-limits, but in general it is very s mall, in the order of perhaps 0.3%. 


It may be of advantage to use two limits, such as a 2 o-warning-limit and a 3 o-action-limit. 
In this case, an action should already take place when two subsequent values lie between 
the two limits. 
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The use of 3 o-limits bases on pure statistical considerations. It may occur that these limits 
are in conflict with the required tolerances of an external standard (e.g. process, standard or 
market requirements). If the required tolerances are smaller than the statistical ones, a 
modification of the process is necessary to improve its precision. In the opposite case, the 
statistical limits may be used in order to detect changes as early as possible. 

Control Charts 


Several types of charts may be used to detect different types of changes in the process: 


Control chart 


Cause of Change jMean | Range | Standard | Cumulative 
ee 


Gross error (blunder ly fas 2 
Shift in average 


[Shiftin variability Ss |- [a te 


Slowfluctuation (trend) [2 [- [- 
Rapid fluctuation cycle) |-1 [2 |- | 


- = not appropriate / 3 = least useful 










2 = useful / 1 = most useful 
X — chart 


Samples of fixed size n are taken from time to time. The arithmetic mean of the samples are 
plotted versus time. 


Control limits (for warning and/or action) can be computed with the help of special 
procedures. 


Mean Blaine 
(cm2/g) 


3700 


3000 


2900 


Date 
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R-chart of s-chart 

Given are small samples of fixed size n. The range or the standard deviation of the sample 
is plotted. 

Values exceeding a certain control limit indicate an increase of variability. 


Stand. deviation 
Blaine 


300 
200 


100 


Date 


Chart of individual values 
Instead of means, individual values ma y be plotted in the order of measurement. 


N/mm? 
~-———  —— upper limit —— — 
70 
66 v, v| NW 
62 
lower limit 
58 


Date 
CUSUNM-chart 


Cumulative-sum-chart. The deviations of individual values from a target value are 
cumulated. 
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This chart is especially sensitive to slow fluctuations in the process. 


N/mm2 


+ 40 


+ 20 


= 20 


Date 


6.5 Comparative representation 
Frequency polygon 


Comparison of the distribution of two samples. Relative frequencies corresponding to the 
histogram are plotted on a line graph against the mid-points of the classes. 


Example: Mortar strength in the first and the second half-year. 





52 S54 586 58 60 62 N/mm? 
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Cumulative Frequency plot 
Comparison of the distribution of two samples. Relative cumulative frequencies are plotted 
against upper class limit. 


* 
100 
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Grouped Box-Plots 


Shows two or more box-plots in the same graph. May be used to show the distribution of 
different variables or the distribution of the same variable in different groups (samples). The 
notched box plot may be used for the latter comparison. 


N/mm 


70 


60 


50 
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Time-plot 
Comparison of simultaneous changes of several variables in an observed period. 
Example: Silica- and alumina ratio of a cement type in a specific year. 


2,5 SR 
2.0 

1,5 | AR 
1,6 


JS FMAM J 3 AS ON OD 


Scatter-plot 
Illustration of the relationship between two variables. 


Example: Relation between strength and density of concrete. The relationship is not linear. 


Don’t use the correlation coefficient in this case, because it is only a measure for linear 
dependence. 


1,0 


Relative strength 





0,75 0,80 0,85 0,90 0,95 10 
Deegree of compactness 
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Comparison of smail samples 
Individual values are marked on the scale separately for each group of observations. 


Example: Inter-laboratory-test concerned with elite-content. 





68 70 72 74% 
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Asymmetrical Flury-Riedwyl Faces 

This is a special version of the faces. E.g. the variables before and after treatment, 
respectively, are mapped to the same face parameter on the left and right face side, 
respectively. Or the left side shows the specification values of the variables and the right side 
the actual measurements. 





7. CORRELATION AND REGRESSION 


7.1 Correlation coefficient 


The degree of linear dependence between two random variables X and Y can be expressed 
by the correlation coefficient ry. 





with s,, = Sx —X)(y;-Yy) the covariance of X and Y 
n- 


i= 
and s,, S, standard deviations of X and Y. 


For practical computations use 


1 1 
Sy =~) XV (XML) 
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The following figure shows the scatter diagrams for several degrees of dependence. 





No Correlation 


Pos.Linear Correlation 





Properties of ry: 


a) ly = ranges from -1 to +1 
fy = +1: All measured values lie on an increasing line 
-1 : All measured values lie on a decreasing line 
0 : No linear relationship between the measured values 


b) tx is a measure of linear dependence. If the scatter diagram indicates a non-linear 
relationship, then the correlation coefficient will be misleading and should not be 
calculated. 

C) fy is very sensitive (not robust) against outliers. 


The necessity of drawing a scatter diagram is illustrated in the following figures. Completely 
different graphs may result with equal correlation coefficients (r = 0.82). 





0 5 10 bt) 20 0 5 10 BS 20 
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ryy can be close to zero even though the variables are clearly non-linear dependent and r,, is 
not defined if s, or s, is zero. 


MN LUT 


r is not defined 


Interpretation of ry 


A high correlation coefficient between two variables does not necessarily indicate a causal 
dependence. There may be a third variable not under control which is causing the 
simultaneous change in the first two variables, and which produces a spuriously high 
correlation coefficient. In order to establish a causal relationship it is necessary to run a 
carefully controlled experiment (see chapter 8). Unfortunately it is often impossible to control 
all the variables which could possibly be relevant to a particular experiment, so that the 
experimenter should always be on the lookout for spurious correlation. 


The following is an example for confusion of correlation with causation: 


The following figure shows the population of Oldenburg at the end of each of 7 years plotted 
against the number of storks observed in the corresponding year. Although in this example 
few would be led to hypothesize that the increased number of storks caused the observed 
increase of population, investigators are sometimes guilty of this kind of mistake in other 
contexts. Correlation between two variables Y and X often occurs because they are both 
associated with a third factor W. In the stork example, since the human population Y and the 
number of storks X both increased with time W over this 7-year period, it is readily 
understandable that a correlation appears when they are plotted together as Y versus X. 
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A plot of the population of Oldenburg at the end of each year against 
the number of storks observed in that year. 


Fig.: 





population (in thousands) 
“I © 
3 ° 


oO 
o 


100 200 300 
number of storks 


By using sound principles of experimental design and, in particular, randomization, data can 
be generated that provide a more sound basis for deducing causality. 
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7.2 Linear Regression 


If the scatter diagram indicates that the two variables are linearly related, then we may want 
to predict the value of one of the variables from a given value of the other variable. For this 
purpose a regression line is fitted to the data (method of least squares). 


Model assumptions 
y,=Q+B +E, (1) 
where the residuals E's are independent normal random variables with mean 0 and constant 
variance 62. 
a = y -intercept 
B = slope 
xX = independent variable - Y dependent variable 


This assumption is essential if parameter reductions (see later), i.e. reduced models are 
tested and the assumptions have to be checked when a model is fitted. 


The estimates of the parameters a and B in model (1) are obtained by the method of Least 
Squares (MLS), i.e. the estimates are obtained by minimizing the sum of squared residuals 
n 9 nn. 2 
Dy; -@- bx = YE, 
jst jel 


with respect to a and B ! 


This method is very wide spread in statistics and the estimates have good statistical 
properties under the above model assumptions! Today there are also procedures which do 
not minimize the sum of squared residuals but use other criteria (Robust Methods!). 


In the following let 


Su = DK) “2 = LAP TDA) = (0-5, 
8, = DU -IP= Ly? (Ly, =(0- 08,7 


Soy = D(%)- HY -V)= DY) - LAV = (0 Sy 


The resulting estimated regression line is 


~ a B , § 
y=G+ 8,where B= 


2 |e 


a=V+ px 
Sy 





with a resulting minimal sum of squares (MSSQ) §,,,, = S,, — with n-2 degrees of 


xx 
freedom (2 parameters a and B are estimated). y is the predicted value for the dependent 
variable y. 
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Note: 


1) In the mentioned problem there are two regression lines, one to predict y from x and one 
to predict x from y. The two lines are not identical and therefore it is not allowed to invert 


the regression equation. 


2) The prediction equation is valid only in the observed range of observations. 
Extrapolations may give misleading results. We have no information that linearity holds 
outside of the present observations. 


For each pair (x;, y;) we can therefore compute the predicted value y of y by 


means of the regression function y = &+ {x i.e. 


a 


¥,=a+ px, 
and the estimated residual 
e=y-y, 
a = S._,,/(n—2) is an estimate for the variance of the residuals. The analysis of the 


residuals é; gives us the possibility to validate the model assumptions (Independence, 


Normality and constant Variability). The first check is done by plotting the residuals against 
the predicted y,. The residuals é, should be randomly scattered around the line e = 0 and 


show no pattern. 
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For example the following structures of the residuals would indicate, that the model is not 
correctly specified (e.g. that the variables x and y should be transformed before calculation 
of the regression line). 





Relation not linear Variance not constant 
(e.g. y = h (a + Bx) (e.g. variance o? proportional 
or y= _]1 to x) 

a + Bx 


The second check would be a normal probability plot of the residuals. 


If the residuals suggest that the assumptions do not hold, then further investigations are 
necessary (suitable transformations, other influencing variables, time-dependence => 
growth curves, time-series-analysis). 


After confirmation of the assumption and calculation of the regression line (sometimes also 
before calculation) it is possible that we want to investigate whether perhaps the y-intercept 
equals zero, or the slope equals zero or whether there is no relation between y and x (a= B 


= 0) or whether the o or B equal some specified values oy or Bo (e.g. if a value of B =1.1 
may be replaced by {, = 1.0). This means that we want to test if a reduced (simplified) 


model is sufficient to describe the relationship between y and x. For this it is essential that 
the model assumptions hold. 


7.2.1 Regression line with slope 0: 

Model: y; = a + E; 

Estimate &@ fora: & = y (= mean of the y;s) 
Minimal Sum of Squares (MSQ): Sin = Sy 
Degrees of freedom (df): df = n-1 


7.2.2 Regression line with y-intercept 0 
Model: y; = Bx; + E; 


2 a X.Y. 
Estimate £ for B: £ LXV 


dx; 
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=Dy?- Bear 


MSQ: S, 


min 


df = n-1 
7.2.3 Intercept = 0. Slope = 0 


Model: y; = E; 

no parameters to estimate 

MSQ=> y;) 

df=n 

Other models: 
y=) + px 
Y=a+ Bx 
Y=r=Qr Box 


The estimates and MSQ can be obtained by differentiation of the corresponding Sum of 
Squared Residuals! 


These special models are only useful if the data really speaks for them! 


7.2.4 Comparison of models 

This is done by performing an Analysis of Variance (ANOVA). ANOVA is a widely spread 
statistical technique. (Comparison of more than two means, Experimental Designs etc.). 
ANOVA compares the Minimal Sum of Squares of the models to each other. The basic 
results for the comparison are filled in a table, the ANOVA-Table. 





ANOVA-TABLE 
[Model _[MSQ_ |adf | 
Ho So nin df° 






| Reduction | S°nin- Snin_| dfo- of 
H, denotes the null hypothesis, A the alternative model. By H, the test-statistic 
p— (Simin - Spin)/ (of? — aff) 
Shin / Of 


is distributed according to a F-distribution with m, = (df° - df) and m2 = df degrees of 
freedom. 





Significance limits: Look up F, -ar( ca) af in Table A-9 with m, = (df°-df) and m. = df degrees 
of freedom. 


Decision: If F >F then conclude that the simplification to the null-model Ho is not 


1-a( f° ~df), af 
permitted, therefore the alternative model has to be used; otherwise (F<F,..) there is no 
evidence that the null-model should be rejected. 
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7.2.5 Standard deviation of the estimates 

The ANOVA of the null hypotheses H’p : y = a and Hy: y = Bx, resp. against the alternative 
A: y= + Bx allows us to compute the standard deviations (sd) of the estimated 
parameters & and £ in the alternative model A: y = a + Bx: let F(B = 0) be the calculated 
F-statistic for the test H’p against A (e.g. ANOVA for B = 0) and correspondingly F(a = 0) for 
H”> against A. 

Then 





q 
JF(a= 0) 
sd(f) _ wA_ 

F(B =0) 


so that the regression line of the alternative (full) model is 


sd(a) = 


ys (oatay cody 

In order to obtain the standard deviation in a model A: y = Bx, this model has to be 

compared with the null-model Ho : y = 0. 

For A: y=a, compare A to Hp: y=0 

For A: y = QotB,, compare A to Ho : y = % 

For A: y = a+Box, compare A to Ho : y = Box 

1-a confidence intervals for the estimated parameters can be obtained by calculating 
GE bt oj2,mSA(a) 


and 
B + t ei2m8a(B) 


where m equals the degrees of freedom in the alternative model and t;_a2m can be looked 
up in Table A-4. 
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7.2.6 Coefficient of determination 


The prediction of y with a regression line is more or less accurate dependent on the degree 
of linear dependence. How well does the regression line fit the data? A measure to express 
the relative accuracy of prediction compared with the total variation of y is the coefficient of 
determination r°. In the case of a regression line the coefficient of determination is the 
square of the correlation coefficient 

22 _ Sy _ Sy 


fr =f =o >= 
v% sis, S,Sy 


The total variation of y can be partitioned into two components. Total variation = explained 
variation + unexplained variation 


where 


total variation = x y,-y)* 
unexplained variation — x y;- yy (observed - predicted) 


explained variation = y'( y- yy? 


The coefficient of determination is the ratio 
2 _ &xplainedvariation 
totalvariation 


in this form r’ is defined also for non-linear curves and multiple regression. 
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7.2.7 Transformations before Regression Analysis 





Kind of transformation Type of relation Regression after 
transformation 
y'=Iny,x(y > 0) y=ae™ y'=In+ fx 
(exponential) 
y, X =|nx,(x > 0) y=a+ Binx y=a+ Bx 
(logarithmic) 
y=lny,x=Inx,(xX,y >0) y=ax y=|Ina+ Bx 
(exponentiation) 
y=y,x(y #0) y=(a+pxy'  _ W=at fx 
y, X= 1/x(x #0) y=a+ pa y=a+ px 
y= Vy, X= x(x #0) y=-— Y= a+ Bx 
ax+ B 
(hyperbolic) 
yelny,x=/x(y>0,x#0) y=ae’/* y=Ina+ Bx 
y'=Vy,x=e*(y #0) _ 1 V=a+ Bx 
a+ Be* 


Model Overview for Linear Regression 


Model Overview for Linear Repression 












B I iy,2+na [a - 2y] - Re mocccne 
Ea 
ewan wre [ems 
[ee fo Da 
| 


- 


(Exyyy - &ERy)? 












+P, a7 ° 


Syy * ala, - (y fr 
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7.2.8 Multiple and non-linear regression 
Often the variable to be predicted is not only dependent on one but on several independent 
variables. 


Model: 

Y= Bo + B,X,; + BoXpj trot BOX +E, 
where E; are independent normal random variables with mean 0 and constant variance o°. 
Again these assumptions have to be checked after a model has been fitted. 
In this case prediction can be improved in a multiple (or multivariate) regression expressed 
as 

Y= B+ BX, + Bx, + B5Xx, +--+ BX, 


Or the scatter diagram shows a non-linear relationship which demands to fit a non-linear 
curve to the data. Often a logarithmic or some other transformation of one or both variables 
may lead to a linear relationship, and a regression line can be fitted to the transformed data. 
In more complex situations further methods are available. 


For multiple, non-linear and mixed (multiple/non-linear) procedures consult literature (ref. 
Chattield 1975) 


Flury + Riedwyl (1988) give a very good praxis oriented introduction to multiple linear 
regression and multivariate analysis, including Discriminant Analysis, Principal Components, 
Identification and Specification analysis. 


Example 6 

The following data represent 
y: water requirement (%) and 
x: grain fraction 10-32 py (%) 


of n = 34 different cements. 





The scatter diagram shows a positive linear relationship between water requirement and 
grain size. Regression lines to predict x or y are clearly distinct. The lines are identical if ry 
is 1 or -1. 
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“HOLDERBANK” 


if r= 0 the lines are perpendicular, i.e. the best prediction of y is the arithmetical mean Y . in 


the same way X is the best prediction of x if no dependence exists. 


2 
= 26 
£ 
® 
E 
£ 
5 
cr 
@ 
&» 
be 
= 
5 24 
3 
22 


36 40 
grain fraction 10 — 32 p in % 


28 32 


Computations 
In a first step five auxiliary sums are provided: 


A= Xx = 1205.3 n=34 
B = =x; = 43'253.19 

C =2y; = 867.1 

D= sy, = 22'150.93 

E=2xy, = 30828.44 

Sy = a e-2 = 2.719545 





44 
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2 
2 - |e] = 15.918333 
n 


* n= 


s? - 1 |o-< = 1.131203 
X = A/n = 35.45 

y =C/n=25.05 

S,y = 89.7550 

Sx = 525.3050 

Sy = 37.3297 

Smin = 21.9973 

df = 32 

6”? =0.6874 

G =0.8291 


Correlation coefficient: r= 
VSS yy 


Regression line: B= 













[Model | Ms@Q__ 
33 


ast: a = 0 

Ho:y=Bx | 1781527 
A:y=a+Bx | 21.9973 32 
| Reduction _| 156.1553 | 1 


__ 156.1553 
21.9973/32 


F 95-1,32 = 4.17 (Table A-9) 


= 227.16 = F(a =0) 


— Simplification not permitted 


df 


Ho:y=a 37.3297 | 33 
A:y=a+B8x | 21.9973 | 32 


_ 15.3324 
21.9973/32 


Fo5,1,32 = 4.17 (Table A-9) 










= 22.304 = F(B =0) 
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-—> Simplified model not allowed ! 


We obtain the following standard deviations for the parameter estimates @ and B 








sé) = 4 - 19-45_ 1.2905 
JF(a=0) 227.16 — 
sd(f) = _ FA _ 0.171 _ 0.0362 


~ JF(B=0) 22.30 


and the regression line is: 
y =19.45+0.171x 6 =0.8291 
( ) (0.036) 


1.29 


95% confidence intervals for @ and B 


From table A-4: toz5,30 = 2.042 
&:19.45 + 2.042 * 1.29 = (16.82, 22.08) 
B:0.17142.042 * 0.036 = (0.097, 0.245) 


Coefficient of determination: 7 = 0.41 


Comment: 

Only 41% of the total variation of Y can be explained by the dependence of the grain 
fraction. The prediction can be improved when further variables are considered in a multiple 
regression. The coefficient of determination increases to 69% if C3A- and alkali-content is 
included in the equation 


y = 17.44 + 0.15x; + 0.26 Xp + 0.52xz 
X; = grain fraction 10-32u% 

X2 = C3A- content % 

X3 = K20 + Na2O% 
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The improvement is visible in a scatter diagram of observed against predicted values. 
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8. STATISTICAL INVESTIGATIONS AND THEIR DESIGN 


In practice numerous problems and questions cannot be answered unless special 
quantitative investigations are performed. This applies to every practical field, starting with 
the raw material exploitation and ending with the sale of the finished product. In many 
cases, it is not possible to attain complete information as an effective basis for decision 
making. For this reason, it is becoming more and more common to resort to statistical 
investigations in form of sample surveys and experiments. 


The proper performance and interpretation of statistical investigations is essential. The so- 
called “lies of statistics” generally have to be attributed to the wrong selection of data, a 
misinterpretation, or procedures which do not relate to the objective. Experience has shown 
that very rarely data are manipulated intentionally. However, people who employ “wrong 
statistics”, are convinced, due to a lack of in-depth knowledge, that their argumentation is 
objective and correct. 


8.1 The Five Phases of a Statistical Investigation 


Despite the variety of potential applications, a statistical investigation can roughly be divided 
into five phases in the following order: 





1) Formulation of problem: 
exact definition of purpose for which information is acquired 


2) Planning: 
purpose-oriented planning of investigation according to statistical principles with the 
objective to acquire optimum information at given expenses. In case of doubt and 
problems contact an experienced statistician. 


3) Performance: 
procurement of data strictly in accordance with the established planning 


4) Evaluation: 
summary and presentation of results; inference from sample to population 


5) Conclusions: 
realistic interpretation of results 


Essential basic principle: 

Statistical investigations will only provide truly efficient results if knowledge in the 
investigated field is optimally combined with thorough statistical knowledge. Usually this will 
lead to teamwork, since the investigator who lacks extensive knowledge in statistics is 
usually just as unsuccessful in his attempt to carry out the investigation on his own as the 
statistician who is entrusted with the entire problem complex. 
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8.2 Sample Surveys and Experiments 


Statistical investigations are subdivided into two groups sample surveys and experiments - 
which differ significantly with respect to their objective and interpretation. 


Experiment 

In an experiment response variables are investigated, which depend on various factors. The 
results are produced by a controlled variation of the factors of primary interest, whereas all 
other effects on the response have to be eliminated. 


Factors that are being studied may be quantitative (e.g. temperature, concentration) or 
qualitative (e.g. type of ample preparation, origin of samples). 


In an experiment, causal effects on the response variable can be determined quantitatively if 
the necessary provisions have been made in the planning and performance. 


Sample survey 

Samples are taken from one or several populations (the set of sample units is called sample 
of size n). Certain interesting characteristics are measured in order to evaluate 
(estimate)unknown characteristics of the populations or to compare populations. Conirary to 
the experiment, the results of a sample survey do not permit any conclusions regarding 
cause and effect. This is often ignored when correlations between two characteristics are 
assessed. 


Reliable results of both experiments and sample surveys can only be obtained if the 
respective statistical principles have been adhered to in the planning and performance of the 
investigation. 


8.3 Fundamental Principles in Statistical Investigations 


8.3.1 Experiments 
The five criteria of a good experiment are: 


1) The experiment should serve a well defined purpose 


In a first step the problem has to be clearly identified for each experiment, and the 
hypotheses to be investigated have to be determined. 


2) Factors. which are not of primary interest. should not influence the results 


Influencing factors, which are not included in the investigation, must be under control at 
fixed levels. Otherwise several effects become mixed up and cannot be separated by 
statistical methods. Systematic effects of factors, which cannot be controlled, are 
eliminated by random allocation of samples to treatments or factor levels and by random 
measuring sequence (randomization). 

Practical performance of randomization: In exampie A 1 we assign a number to each 
sample, No. 1 - 8 to tablets and No. 9 - 16 to beads. Each number is written on a leaflet 
and mixed in a box. The order of measurement is given by blind drawing of the leaflets 
from the box. 


3) The experiment should be free of systematic errors 


This requirement is partially connected with 2). Systematic errors, due to any change of 
effects in time, are eliminated by randomization. An often underestimated source of 
systematic errors is the prejudice of the experimenter. In order to avoid this, mainly 
“blind” experiments should be performed. The samples are coded with random numbers, 
the decoding key only known to a confidential person, who herself is not in a position to 
perform analyses. 


4) The experiment should provide a measure of its precision 


An estimate of the precision is obtained by a replication of the experiment. For each step 
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of the experiment several measurements should be taken. In example A 1 it would be 
impossible to state whether a systematic difference exists, if only one tablet and one 
bead are measured. (Appendix I) 


5) Precision and efficiency of an experiment should be high enough to reach the set goals 

There are various measures to improve the precision and efficiency of an experiment: 

e Reduction of variability by using homogeneous materials and by carefully controlling 
all the factors, as well as by strictly observing the analytical regulations. 

e Increasing the number of replications; this will lead to the problem of determining the 
sample volume required to achieve the desired precision. 

e Blocking: Measurements are performed in homogeneous groups (blocks). Such 
blocks may be ’measurements made by the same operator’ or ’measurements made 
on the same day’. 

A special blocking procedure is the paired comparison. In example A 3 both 
laboratories measure the compressive strength on samples drawn from the same 
cement bag. 

Block experiments should be balanced and randomized, i.e. the number of 
measurements and treatments is equal in each block and assignment of samples or 
treatments to blocks is random. 

Special cases of blocking: paired comparison. 

e Analysis of covariance: factors, which are not subject to the experimenter’s control, 
are recorded in order to eliminate their effects in a later analysis. 


The principles above are valid for any experimentation. Special designs for various problems 
are given in the literature. Problems may be: Effects of one or several factors ona 
response, additivity of factor effects, splitting of components of variability ina measurement 
procedure in order to achieve a prescribed precision at minimum cost, inter-laboratory tests, 
calibration, evaluation of systematic errors etc. 


Note: 


In contrast to a regression analysis with a given set of observations, a carefully designed 
experiment allows the evaluation of causal relationships. In regression the effects are 
usually mixed and can only be separated due to some mathematical model and not due to 
their real origin (ref. Interpretation of r,, section 7.1). 
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STRUCTURE OF AN EXPERIMENT 
1) Recognize problem 
2) Describe problem in detail and determine hypotheses 


3) Define experimental area; contact statistician; determine factors, levels and number of 
replications; elaborate exact analytical regulations; determine a reference basis if 
required. It is often necessary to analyse what happens if no treatment is applied (e.g. 
Placebo in clinical experiments). 


4) Establish experimental design, taking into account methods to improve precision 
(blocking). 
Randomization: 
- random selection of sample units 
- random allocation of treatments 
- coding of samples 


5) Specify variables, which cannot be maintained constant (covariables) 
6) Determine number of replications 
7) Determine procedural organization 


8) Determine procedures of statistical data analysis; exact regulation concerning the 
manner in which data are to be supplied (form) 


9) Perform experiment 

10) Evaluate results (statistical data analysis) 
11) Draw conclusions 

12) Take measures 


Step 2) to 7) are designated as experimental planning. The results of this planning should 
absolutely be recorded in an experimental plan. Since experiments are usually quite 
expensive, careful planning will definitely be worthwhile. 


The request to prepare a written experimental plan entails significant advantages: 
@ the necessity to formulate statements in precise terms 
¢ clear conditions and thus less uncertainty in the performance of the experiment 


¢ the circumstances of the experiment can be reconstructed, if later on the results are 
used for comparison with new results. 


8.3.2 Sample Surveys 

Sample surveys and experiments are performed in an analogous way. Differences are due 
to the different situation. While the purpose of an experiment is to directly produce results, 
sampling is carried out to analyse existing characteristics of a population. 


Special attention should be paid to the following points: 


1) Target population and sampled population 


The target population is the aggregate about which the investigator is trying to make 
inferences from his sample. It is usually helpful to focus the attention on differences 

between the population actually sampled and the population that is attempted to be 

studied. 
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2) Sample unit 
The sample units have to be described in clear terms. They should not overlap and the 


sum of all the sampie units must be equal to the investigated population. 
In the cement production for instance, if an individual sample is analysed, the results and 
interpretations are different from those of an analysis performed on a daily composite 
sample. 

3) Representativity and sampling method 
Samples only supply unbiased results if they represent the entire population. It is often 
mistakenly believed that the large sample size n ensures its representativity, or it is 
simply maintained that the sample is representative in order to prevent rejection of the 
results. 
Representativity can only be ensured by an appropriate, random sampling method, 
giving each sample unit an equal chance to be included in the sample (exceptions in 
special cases: sampling with unequal probabilities, systematic sampling). 


in literature a number of selection procedures are described. Depending upon the problem 
situation, various methods ma y be considered. They differ in their practicability, financial 
consequences (expenses) and efficiency. 


in most cases an optimum selection procedure for a specific problem situation can be found. 
STRUCTURE OF A SAMPLE SURVEY 
1) Recognize problem. What information is required about what populations? 


2) Detailed definition of problem 
- definition of target populations 
- what information with what accuracy? 


3) Determine the sample population. Where, when and how are the sample units extracted 
from the population? Determine the variables to be recorded 


4) Determine sampling method (equal probability sampling, stratified sampling, cluster 
sampling, multi-phase sampling, multi-stage sampling, systematic sampling, unequal 
probability sampling)\ 


5) Determine sample scheme (in accordance with the required accuracy) 
6) Outline procedural organization 


7) Determine procedures of statistical data analysis; exact regulation concerning the 
manner in which data are to be supplied 


8) Perform sampling 

9) If possible perform measurements in random sequence 
10) Evaluation and presentation 

11) Conclusions 

12) Take measures 


Ne 
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9. OUTLOOK 


This section will give a short outlook to further and perhaps more sophisticated statistical 
applications. It also contains references for further readings. All these applications require 
more than elementary knowledge of statistical methods. 


9.1 Time series and growth curves analysis 


This is the analysis of chronological observations, (e.g. daily temperatures at a certain 
iocation or observations over a certain time period of a patient obtaining a certain 
medicament). The characteristic of these data are that the observations are not independent 
from each other and the method of linear regression cannot be applied. 


Literature: 


Box, G.E.P. and Jenkins, G.M. (1976). Time Series Analysis, Forecasting and Control, 
second edition. San Francisco: Holden-Day, Inc. 


Cox, D.R. and Lewis, P.A.W. (1966). The Statistical Analysis of Series of Events. London: 
Methuen. 


Nelson, C.R. (1973). Applied Time Series Analysis for Managerial Forecasting. San 
Francisco: Holden-Day, Inc. 


9.2 Categorical and Qualitative Data Analysis 


This is the analysis of counts. Many experiments contain qualitative and nonmetric 
variables, e.g. when analyzing the number of smokers and non-smokers beyond male and 
female persons. Smoking behaviour and sex have no natural numeric values associated 
with them. Used statistical methods for such data are 


Crosstabulation 


Contingency tables 


+ 
+ 

¢ Goodness of fit tests 
¢ Log linear models 

@ Logistic regression 

@ Correspondence analysis 

Literature: 

Agresti, A. (1984). Analysis of ordinal categorical data. New York: Wiley-interscience. 


Bishop, Y.M.M., Fienberg, S.E. and Holland, P.W. (1975). Discrete Multivariate Analysis: 
Theory and Practice. Cambridge, MA: MIT Press. 


Haberman, S.J. (1978). Analysis of Qualitative Data, Voi. 1: Introductory Topics. New York: 
Academic Press. 


Haberman, S.J. (1979). Analysis of Qualitative Data, Vol. 2: New Developments. New York: 
Academic Press. 
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9.3 Experimental Designs and ANOVA 


This is a wide field containing among others: 
One-Way ANOVA 

Multifactor ANOVA 

Analysis of Nested Designs 

Full and Fractional Designs 

Response Surface Analysis 


Covariance Analysis 


--f-f ¢ © © @ 6 


and a lot more. 


Literature: 

Box, G.E.P., Hunter, W.G. and Hunter, J.S. (1978). Statistics for Experimenters. New York: 
Wiley. - 

Neter, J. and Wassermann, W. (1974). Applied Linear Statistical Models. Homewood, 
IItinois: Richard E. Irvin, Inc. 
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9.4 Multivariate Methods 


This title involves the analysis of multivariate data. It is not appropriate to analyze 
multivariate data univariate, because correlations among the different variables are not 
taken into account. 


Statistical Methods: 
Correlation Analysis 
Covariance Analysis 
Multiple Regression 
Principal Components 
Factor Analysis 
Discriminant Analysis 
Cluster Analysis 
Canonical Correlations 


Multidimensional Scaling 


“ef; + © Heh OHmhUh OrhlUCOrhlUChOhlhUO 


and a lot more. 
Literature: 
Sieber, G.A.F. (1984). Multivariate Observations. New York: Wiley. 


Flury, B. and Riedwyl, H. (1988). Multivariate Statistics. A practical approach. London, New 
York: Chapman and Hall. 


Johnson, R.A. and Wichern, D.W. (1982). Applied Multivariate Statistical Analysis. London: 
Prentice-Hall. 


Morrison, D.F. (1976, 2nd edition). Multivariate Statistical Methods. New York: McGraw-Hill. 


Everitt, B.S. (1980). Cluster Analysis, 2nd edition, London: Heinemann Education Books, 
Ltd. 
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9.5 Nonparametric Methods 


Parametric methods base on certain assumptions on the data (e.g. normality of residuals in 
linear regression, normality of the observations in testing etc.). If these assumptions do not 
hold, it is often more efficient to use methods that do not use a particular underlying 
distribution function, so called nonparametric methods (e.g. the Wilcoxon Test known for 
Section 5.2.4 is a nonparametric test). Some nonparametric procedures exist for: 


¢ Tests of binary sequerices 
Tests for randomness 


Tests for location 


+ 

+ 

¢ Comparison of 2 samples 
¢ Rank correlation analysis 
+ 


Goodness of fit tests 
Literature: 
Conover, W.J. (1980). Practical nonparametric statistics, 2nd edition. New York: John Wiley 
and Sons, Inc. 
Lehmann, E.L. (1975). Nonparametrics. San Francisco: Holden-Day, Inc. 
Gibbons, J.D. (1976). Nonparametric Methods for Quantitative Analysis. New York: Holt, 
Rinehart and Winston. 
Hollander, M. and Wolfe, D.A. (1973). Nonparametric Statistical Methods. New York: Wiley. 


9.6 Bootstrap and Jack-knife Methods 


In dealing with complicated functions of data, it is mostly not possible to derive the 
underlying distribution function. Bootstrap and Jackknife sometimes provide the possibility to 
derive the distribution functions, statistical measures and a lot more at such complicated 


data. 
Literature: 


Efron, B. (1982). The Jackknife, the Bootstrap and Other Resampling Plans. Philadelphia: 
Soc. for Industrial and Applied Math. 


9.7 Simulation and Monte Carlo Method 


By use of simulation and Monte Carlo methods it is often possible to investigate and analyse 
complex functions of random variables or systems of events. Simulation incorporates also 
the generation of random variables. 

Literature: 

Fishman, G.S. (1978). Principles of Discrete Event Simulation. New York: John Wiley & 
Sons, Inc. 


Rubinstein, R.Y. (1981). Simulation and the Monte Carlo Method. New York: John Wiley & 
Sons, Inc. 
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9.8 General Literature 
Seber, G.A.F. (1984). Multivariate Observations. New York: Wiley. 


Belsley, D.A. Kuh, E. and Welsh, R.E. (1980). Regression Diagnostic: Identifying Influential 
Data and Sources of Collinearity. New York: Wiley. 


Chambers, J.M., Cleveland, W.S., Kleiner, B. and Tukey, P.A. (1983). Graphical Methods 
for Data Analysis. Boston: Duxbury Press. 


Flury, B. (1980). Construction of an asymmetrical face to represent multivariate data 
graphically. Tech. Rep. No. 3, University of Berne, Dept of Statistics. 


Schupbach, M. (1984). ASYMFACE - Asymmetrical Faces on IBM-PC. Tech.Rep. No.16. 
University of Berne, Dept of Statistics. 


Draper, N.R. and Smith, H. (1981), 2nd ed.). Applied Regression Analysis. New York: Wiley. 


Grant, E.L. and Leavenworth, R.S. (1980). Statistical Quality Control, fifth ed. New York: 
McGraw Hill. 


Montgomery, D.C. (1985). Introduction to Statistical Quality Control. New York: Wiley. 
Recommended literature pp. 90 ff 

Huff, D. (1974). How to lie with statistics. 

Miller, R.G. (1981). Simultaneous Statistical Inference, 2nd ed. New York: Wiley. 
Morrison, D.F. (1983). Applied Linear Models. Englewood Cliffs, NJ: PrenticeHall Inc. 


10. STATISTICAL PROGRAM PACKAGES 


Very complete and sophisticated packages are (all programs are available for PC’s under 
DOS): 


BMDP Statistical Software Ltd., Cork Technology Park. Cork, 
Ireland (phone: 021-542722) 
SAS: SAS Inst. GmbH, Cary NC, USA 
. (phone: (919)467-8000) 
SPSS SPSS Inc., Chicago Il, USA (phone: (312)329-3300) 
SYSTAT SYSTAT Inc., Evanston Il, USA (phone: (312)864-5670) 


STATGRAPHICS STSC Inc., Rockville MD, USA (phone: (301)984-5000) 
A review of 49 statistical packages is given in the PC magazine, Vol. 8, No. 5, March 1989. 
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Appendix | 


Example Ai: 
In the application of XRF-analysis of raw meal it was required to know to what extent the 
results are dependent on sample preparation, by pressed powder tablets and by fusion with 


Litetraborate, respectively. 
With 8 preparations of tablets and beads of the same raw meal the following results for CaO 
and Si02 were obtained. 


cao 


We are interested in differences between the preparation methods. To get a quick survey on 
the data we mark every observation on the measurement scale for tablets and beads 


respectively. 






sion 
[tabiets | beads | 










Tablets 
SiO. 
Beads 
13,6 13,8 14,0 14,2 
Tablets 
CaO 
Beads 
42,2 42,4 42,6 42,8 43,0 
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Looking at the graph we suppose a significant difference of CaO-results between 

beads and tablets. For Si 02 results no difference is visible. 

The final decision whether the differences are significant is made with an appropriate test. 
Test situation: 


¢ We test the hypothesis that the mean results of both preparation methods are the same 
Ho: Btablets = Ubeads 
¢ Two independent samples with small sample sizes N = 8 


¢ two-sided test: we have no idea in what direction a possible difference may occur before 
the observations are taken. 


Test procedure: Wilcoxon-test at a = 5% level 


As an alternative representation we can mark the data on both side of one line. In this case 
ranks of the observations can be read directly from the graph. 


Tablets 
CaO 
Beads 
42,2 42,4 42,6 . 42,8 43,0 
1 3,67 8,5 
Ranks 4, N 


The ranksum of beads is R = 45.5 
W=2.45.5.-8.17=-45 
IW! = 45 => 38 = Woo7s (from Table A-8, Appendix III). 
Decision: 
The difference of CaO results between tablets and beads is significant. 


Tablets 
SiO, 
Beads 
13,6 13,8 14,0 14,2 
Ranks 3459 05 4 15 


Wl=1<38= Woo7s 


There is no significant difference for SiO. results. 
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Example A2: 

Control of homogenization efficiency. 
In intervals of 30 minutes, 20 samples are taken from the raw ma serial stream both before 
and after homogenization. 

Measured values: CaCO; content 

Total capacity of homogenization silo: 1000 tons 


homogenization homogenization 
jae 
207 dO 
3 
76.8 
76.9 
. 76.8 
[7 os 
eo 
foo 
77.2 
aaa 
m1 id 


77.1 76.9 
76.9 76.9 
76.7 77.0 
77.0 
76.0 
[e088 re 


The graph is chronological order (time-plot) shows a systematical variation of CaCO3 
content before homogenization. Except for one outlying value the variation is small after 
homogenization. 
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Before Homogenization 


before homogenization 


CaCO, % 





o 5 10 15 20 
sample number 


After Homogenization 


after homogenization 


outlier 





v 5 10 BL 20 
sample number 
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Using the Dixon criterion to check for outliers we compute 


r= Xia) — Xi _ 76.8-—76.0 _ 
* Xn-2)- Xa 77.0-76.0 


with x,,) the smallest, X(g) the third smallest and X(,2) the largest value. 


Since too = 0.8 > to.995 = 0.562 (Table A-2, Appendix III) the extreme value is considered to 
be a real outliner (with significance level a = 1%). It is deleted for further analysis. 


Statistical data description 






CaCo, before 
homogenization 












Xm 1775 id.) 7 ——s—idzr 
[xmin [76.0 | (76.0 76.8 
rR 4. 






Comment: 


Mean and median are almost equal. The distribution of observations seems to be therefore 
symmetrical about the mean either before and after homogenization. Due to homogenization 
the standard deviation is reduced from 0.36 to 0.08 corresponding to a factor of four to five. 


Note: 
Elimination of the outliner reduces the standard deviation from 0.22. to 0.08 ! 
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Example A3: 
In order to control mortar strength a national control laboratory takes every month a sample 
of cement in a plant. It was supposed that these control results tend to be lower than results 
of internal quality control. 

To find out a suspected syste ma tic error between results of control office and plant 
laboratory during a year the samples were measured in both laboratories: 

Control y | Difference 
d=x- 









January 
February 
March 
Apri 

May 

June 

July 
August 
September 
October 
November 
December 
The strength values are paired. Each pair of observation is concerned with the same 
sample. To test for a significant difference we use the singed-rank test at a significance 
level of a = 5% (two-sided) 















Ho: w= 0 


Plot of differences d; = x; - y; 





Difference -20 0 «ao 49 60 80 


sianed rank -6 = -2 1345 78 9 "1 12 


Sum of signed ranks: 
T = (-6) + (-2) + 1+ 3+ 4+ 5+ 7+ 8+ 94+ 10+ 114 12 = 62 
Since T = 62 > 52 = To97s5 we decide that the difference between results is systematic. 
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Example A4: 
The following data represent 115 measurements of Schmidt-haamer-strength (abutment of 
a bridge). 





A detailed interpretation of these data requires their representation in a frequency table and 
histogram. With the probability paper we check the data for normality. 


The cumulative frequency curve plotted on the probability paper is not linear and the 
distribution is therefore not normal. 


lil ' $ { i ‘ 
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Histogram and tally show that even values of Schmidt-hammer strength are more frequent 
than odd values. This seems to be caused by a reading error of scale. Probably only even 
values are marked on the measurement scale. 
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Table A-1 Standard Normal Distribution - Values of P 





Values of P corresponding to z, for the normal curve. 


z is the standard normal variable. The value of P -z, equals one minus the value of P for +2,, 
e.g. the P for -1.62 equals 1-0.9474 = .0526. 
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Yoooe 
abnuao 


15 
1.6 
17 
8 
1.9 

0 

1 
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Table A-2 Critical values for the Dixon Criterion 


Upper Percentiles 


Number of 
Obser- 
vations, 





a NI 
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Table A-3 Critical values for Outlier-tests, (large sample size n) 





2,95 | 3,01 | 3,10 | 3,18 | 3,29 | 4.32 | 4,49 | 4,63 | 4.79 | 4.91 20 
3,22 | 3,27 | 3.37 | 3,46 | 3,58 | 4,70 | 4.89 | 5,06 | 5,25 | 5,39 30 
3,41 | 3,46 | 3,57 | 3,66 | 3,79 | 4.96 | 545 | 5,34 | 5,54 | 5,69 40 

50 


3,57 | 3,61 | 3,72 | 3,82 | 3,04 | 5,15 | 5,35 | 5,54 | 5,77 | 5,91 5 

3,69 | 3,74 | 3,85 | 3,95 | 4,07 | 5,29 | 5.50 | 5,70 | 5.93 | 6.09 60 
3,88 | 3,93 | 4,05 | 4,15 | 4,27 | 5,51 | 5,73 | 5.93 | 6.38 | 6,35 80 
100 | 4,02 | 4,09 (4,20 / 4,31 | 4,44 } 5,68 | 5,90 | 6.11 | 6,36 | 6.54 | 100 


SSS ES 


150 | 4,30 | 4,36 | 4,47 | 4,59 | 4,72 | 5,96 | 6,18 | 6,39 | 6,64 | 6,84 150 
200 | 4,50 | 4,56 | 4,67 | 4,78 | 4,90 | 6,15 | 6.38 | 6,59 | 6.85 | 7,03 } 200 
500 | 5,06 | 5,13 | 5,25 | 5,37 | 5.49 ] 6.72 | 6.94 | 7,15 | 7,42 | 7,60 | 500 
1000 | 5,50 | 5,57 | 5,68 | 5,79 | 5,92 | 7,11 | 7,33 | 7,54 | 7,80 | 7,99 | 1000 


Table A-4 Critical values of the t-distribution 
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3.078 651 
1.886 
1.638 "841 
1.533 "604 
1.476 "082 
1.440 107 
1.415 499 
1.397 355 
1.383 
1.372 3.169 
1.363 3.106 
1.356 3.055 
1.350 3012 
1.345 2.977 
1.341 2.947 
1.337 2.921 
1.333 2898 
1.330 2.878 
1.328 2 861 
1.325 2845 
1.323 2.831 
1.321 2.819 
1.319 2.807 
1.318 297 
1.316 2.787 
1.315 2.779 
1.314 2.771 
1.313 2.763 
1.311 2.756 
1.310 2.750 | 
1.308 2.704 | 
1.296 2. 660 
1.289 2617 
1.282 2.576 
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Table A-5 Critical values of the x’-distribution 


3.84 
5.99 
7.81 
9.49 
1.07 


12.59 
14.07 
15.51 
16.92 
18.31 


1.24 
1.69 
2.18 
2.70 
3.25 


19.68 
21.03 
22.36 
23.68 
25.00 


SBlses 


26 .30 
28.87 
31.41 
36.42 
43.77 


55.76 
78.08 
146.57 





For large degrees of freedom, 
xp = Uzp + V/2m— 1)? approximately, 
where m = degrees of freedom and z; is given in Table A-1 


© Holderbank Management & Consulting, 2000 Page 541 


“Holderbank” Cement Seminar 2000 CN TTI ATS 


Materials Technology II - Statistics 


Table A-6 Values of ,.,. for confidence limits (Range method) 





7 : " 
m™ 3% | 2.5% | i% | 05% | 0.1% | 0.05 % 





2 3.157 6,353 | 15.910 | 31,828 (159,16 | 328,31 
3 0,885 1,304 2,111 3,008 6.77 9,58 
4 029 0,717 1,023 1,316 2,29 2,35 
3 mS) »007 0,685 0,343 1,32 1,38 
8% 0,312 0,399 0,523 0.628 0,92 1.07 
7 263 a3 429 307 71 0,82 
8 7230 288 366 429 aa 87 
9 yo05 yaa yan et A ve 


10 | jtsc | 230 | ss | 233 | tt | lO 
11 [| 0170 | 0,210 | o2c2 { @sne | o40 | O48 


12 ,138 194 241 207 36 40 
13 liz sl 224 256 ,30 a7 
14 138 170 209 ,239 31 at 
13 ,131 180 ,197 294 29 32 
16 0,124 0,151 0,186 212 27 0,30 

7 8 144 177 201 26 28 
183 3 »137 168 ,191 2 28 
19 108 131 161 .isg 23 25 
oy 104 126 SS 175 2 24 





2 Nach E. Lorp. The use of range in place of standard deviation in the ¢-test. 
Biometrika 34, 1947, S. 41. 


SPD SOC 
Page 542 © Holderbank Management & Consulting, 2000 


; 
“Holderbank” Cement Seminar 2000 


Materiais Technology II - Statistics 


Table A-7 Critical values of the signed-rank test 


number of differences 
not equal zero: m 
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Table A-8 Critical values of the Wilcoxon-test 


size of smaller sample ny 
SPER cERreeetce eee 2 | 


110 | 114 126 
105 {| 130 
102 } 106 


Values of Wo 95 



















size of smaller sample ny 


Values of Wo .975 
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Fy 975 


Table A-9 (continued) Critical values of the F-distribution 


Gases q2s=3 3 


52 FSS 
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ee 
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lay 


8.51 
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STR SO 
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Seeded etesescitt 
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ist 


986. ty | 963.1 | go72 08g 
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39.4 
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Recommended Literature 


The following list gives a selection of books on applied statistics for further reading. The list 
is restricted to books which are easily comprehensible for users without profound knowledge 


in mathematics. 
C. Chatfield 


Noether 


M.G. Natrella 


M.R. Spiegel 
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"Statistics for Technology" Chapman and Hall, London 
(1975), 350 pages 

General survey on statistical methods with good 
comments on the interpretation of statistical results. 
Special chapters on regression, design of experiments 
and quality control 

"introduction to Statistics: A Fresh Approach" Houghton 
Mifflin Company, Boston (1971) 230 pages 

Modern statistical methods, especially for the analysis of 
experiments. Good explanation of basic statistical ideas 
and problems based on intuition. 

"Experimental Statistics" US Department of Commerce, 
NBS Handbook 91 (1963) 

"Cookbook" with many test procedures and designs of 
experiments. Often it is somewhat difficult to know what 
procedure has to be chosen. 

"Theory and Problems of Statistics" Schaum’s Outline 
Series,(1961) New York 350 pages 

Definitions and procedures of classical statistics 
accompanied by many examples (875 solved problems) 
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